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his text is intended for a one-semester course in the calculus of functions of 

several variables and vector analysis, which is normally taught at the sophomore 
level. In addition to making changes and improvements throughout the text, in this new 
edition we have added considerable material that presents the historical development 
of the subject and have also attempted to convey a sense of excitement, relevance, 
and importance of the subject matter. 


Prerequisites 


Sometimes courses in vector calculus are preceded by a first course in linear alge- 
bra, but this is not an essential prerequisite. We require only the bare rudiments of 
matrix algebra, and the necessary concepts are developed in the text. If this course 
is preceded by a course in linear algebra, the instructor will have no difficulty en- 
hancing the material. However, we do assume a knowledge of the fundamentals of 
one-variable calculus—the process of differentiation and integration and their geo- 
metric and physical meaning as well as a knowledge of the standard functions, such 
as the trigonometric and exponential functions. 


The Role of Theory 


The text includes much of the basic theory as well as many concrete examples 
and problems. Some of the technical proofs for theorems in Chapters 2 and 5 
are given in optional sections that are readily available on the book's Web site at 
www.whfreeman.com/MarsdenVCS5e (see the description on the next page). Section 
2.2, on limits and continuity, is designed to be treated lightly and is deliberately brief. 
More sophisticated theoretical topics, such as compactness and delicate proofs in in- 
tegration theory, have been omitted, because they usually belong to a more advanced 
course in real analysis. 


Concrete and Student-Oriented 


Computational skills and intuitive understanding are important at this level, and we 
have tried to meet this need by making the book concrete and student-oriented. For 
example, although we formulate the definition of the derivative correctly, it is done by 
using matrices of partial derivatives rather than abstract linear transformations. We 
also include a number of physical illustrations such as fluid mechanics, gravitation, 
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and electromagnetic theory, and from economics as well, although knowledge of these 
subjects is not assumed. 


Order of Topics 


A special feature of the text is the early introduction of vector fields, divergence, and 
curl in Chapter 4, before integration. Vector analysis often suffers in a course of this 
type, and the present arrangement is designed to offset this tendency. To go even fur- 
ther, one might consider teaching Chapter 3 (Taylor’s theorems, maxima and minima, 
Lagrange multipliers) after Chapter 8 (the integral theorems of vector analysis). 

This fifth edition was completely reset, but retains and improves on the balance 
between theory, applications, optional material, and historical notes that was present 
in earlier editions. 


Supplements 


One of the main changes in this edition is in the supplement. They are as follows: 


1. Web Site. The book’s Web site contains the following materials: 


e Internet Supplement, a PDF file containing additional material suitable for 
projects as well as technical proofs and sample examinations with complete 
solutions. 

* PowerPoint and KeyNote Slides for instructors to use in presentations of the 
text's figures, as well as section-by-section summaries. 

* LaTeX and PDF Files of Sample Exams (on instructor's protected site) 

* Updates 


It is available to everyone and can be found at www.whfreeman.com/ 
MarsdenVCSe. 


2. Student Study Guide with Solutions. This student guide, written by Karen Pao 
and Fred Soon, contains helpful hints and summaries for the material in each 
section, contains the solutions to selected problems, and contains sample exams 
to help students in exam preparation. Problems whose solutions appear in the 
Student Study Guide have a colored number in the text, for easy reference. The 
guide has been revised and reset for the Fifth Edition of Vector Calculus. ISBN 
0-7167-0528-1 

3. Instructor’s Manual with Solutions. This supplement contains material 
available only to instructors. This includes summaries of material and additional 
worked-out examples that are helpful in the preparation of lectures. It also 
contains additional solutions to problems and sample exams (some of them with 
complete solutions). ISBN 0-7167-0646-6 


Final Exam Questions 

There are practice exams available in the Student Study Guide, the Internet supple- 
ment, as well as in the Instructor’s Manual. We also include some final exam ques- 
tions (some of them challenging) for the reader’s convenience on the book’s Web site. 
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Of course, the level and choice of topics and the lengths of final exams will vary from 
instructor to instructor. Working these problems requires a knowledge of most of the 
main material of the book, and solving 10 of these problems should take the reader 
about 3 hours to complete. Some solutions are also given on the book's Web site. 

We are excited about this new edition of Vector Calculus, especially the inclusion 
ofthe new historical material as well as the new discussions of interesting applications 
of vector analysis, both mathematical and physical. We hope that the reader will be 
equally pleased. 


Jerry Marsden and Tony Tromba, 
Caltech and UC Santa Cruz, Summer 2003. 
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Metin? Stute 


A Brief Account 


This, therefore, is Mathematics; she reminds you of the invisible form of 
the soul; she gives life to her own discoveries; she awakens the mind and 
purifies the intellect; she brings light to our intrinsic ideas; she abolishes 
oblivion and ignorance which is ours by birth. 


fProclus, c. 450 


Cum Deus Calculat Fit Mundus. 
(As God calculates, so the world is created). 


Leibniz, c. 1700 


he word mathematics derives from the Greek word mathema, meaning knowl- 

edge, cognition, understanding, or perception, suggesting that the study of what 
we now call mathematics began by asking questions about the world. In fact, the his- 
torical evidence suggests that mathematics began about 2700 years ago as an attempt 
to comprehend nature. Unfortunately, in most mathematical expositions, historical 
motivations and contexts are often sacrificed. In this new edition, the authors con- 
tinue to address this problem by increasing the discussion of historical and contextual 
material where appropriate. Therefore, before we dive into the mathematics of Vector 
Calculus, we briefly discuss the development of mathematics prior to and including 
the discovery of calculus. 


Egyptian, Babylonian, and Greek Mathematics 


It is generally acknowledged that mathematics developed in the seventh and sixth 
centuries B.C., somewhat after the Greeks had developed a uniform alphabet. This is not 
to say, however, that mathematical knowledge did not exist before the Greeks. In fact, 
the Egyptians and Babylonians knew many empirical facts centuries before the rise 
ofthe Greek civilization. For example, they could solve quadratic equations, compute 
the areas of certain geometric figures, such as squares, rectangles, and triangles, and 
they possessed a reasonably good formula for the area of a circle, using the value of 
3.16 for z. They also knew how to compute certain volumes like the size of cubes, 
rectangles, rectangular solids, cones, cylinders, and (not surprisingly) pyramids. The 
ancients were also acquainted with the pythagorean theorem (at least empirically). 
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Historical Introduction 


The Greeks, who settled throughout the Mediterranean, must have played an 
important role in preserving and spreading the mathematical knowledge of the Egyp- 
tians and the Babylonians. However, the Greeks were aware that there were different 
formulas for the same area or volumes. For example, the Babylonians had one formula 
for the volume of a frustum of a pyramid with a square base, and the Egyptians had 
another (see Figure 1). 


Figure 1 Volume of a frustum of a pyramid with a 
square base: V — ih(a? +ab 4- b^). 


It is not surprising that the Egyptians (with the experience in pyramid construc- 
tion) had the correct formula. Now, given two formulas, it was clear that only one 
could be correct. But how could one decide such an answer? Certainly it is not a 
question for debate, as would be the question of the quality of works of art. It is 
likely that the necessity to determine the answers to such questions is what led to the 
development of mathematical proof and to the method of deductive reasoning. 

The person usually credited for the invention of rigorous mathematical proof 
was a merchant named Thales of Miletus (624—548 B.C.). It is Thales who is said 
to be the creator of Greek geometry, and it was this geometry (earth measure) as an 
abstract mathematical theory (rather than a collection of empirical facts) supported 
by rigorous deductive proofs that was one of the turning points of scientific thinking. 
It led to the creation of the first mathematical model for physical phenomena. 

For example, one of the most beautiful geometric theories developed during 
antiquity was that of conic sections. See Figure 2. 

Conics include the straight line, circle, ellipse, parabola, and hyperbola. Their 
discovery is attributed to Menaechmus, a member of the school of the great Greek 
philosopher Plato. Plato, a student of Socrates, founded his school The Academy (see 
Figure 3) in a sacred area of the ancient city of Athens, called Hekadameia (after 
the hero Hekademos). All later academies obtained their name from this institution, 
which existed without interruption for about 1000 years until it was dissolved by the 
Roman Emperor Justinian in A.D. 529. 

Plato suggested the following problem to his students: 


Explain the motion of the heavenly bodies by some geometrical theory. 


Why was this a question of interest and puzzlement for the Greeks? Observed 
from the Earth, these motions appear to be quite complicated. The motions of the 
sun and the moon can be roughly described as circular with constant speed, but 
the deviations from the circular orbit were troublesome to the Greeks and they felt 
challenged to find an explanation for these irregularities. The observed orbits of the 


Historical Introduction XV 


Figure 2 The conic sections: (A) hyperbola, 
(B) parabola, (C) ellipse, (D) circle. 


planets are even more complicated, because as they go through a revolution, they 
appear to reverse direction several times. 

The Greeks sought to understand this apparently wild motion by means of 
their geometry. Eudoxus, Hipparchus, and then Apollonius of Perga (262-190 B.C.) 


Figure 3 Plato’s Academy (mosaic 
found in Pompeii, Villa of T. Siminius 
Stephanus, 86 x 85 cm, Naples, 
Archaeological Museum). With certainty 
the seven men have been identified as 
Plato (third from the left) and six other 
philosophers, who are talking about the 
universe, the celestial spheres, and the 
stars. The mosaic shows Plato’s Academy, 
with the city of Athens in the background. 
It is probably a copy (from the first 
century B.C.) of a Hellenistic painting. 
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suggested that the celestial orbits could be explained by combinations of circular mo- 
tion (that is, through the construction of curves called epicycles traced out by circles 
moving on other circles). This idea was to become the most important astronomical 
theory of the next two thousand years. This theory, known by us through the writings 
of the Greek astronomer Ptolemy of Alexandria, ultimately becomes known as the 
“Ptolemaic theory.” See Figures 4 and 5. 
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Figure 4 Woodcut from Georg von Peurbach’s 
Theoricae novae planetarum, edited by Oronce Fine as a 
teaching text for the University of Paris (1515). It was the 
canonical description of the heavens until the end of the 
sixteenth century, and even Copernicus was to a large 
extent under the influence of this work. Peurbach 
described the solid sphere representations of Ptolemaic 
planetary models, which he probably based on Ibn 
al-Haytham's work “On the configuration of the world” 
(translated into Latin in the thirteenth century). The same 
frontispiece was used for the Sacrosbosco edition of the 
first four books of Euclid's Elements (in excerpts), which 
appeared under the title Textus de Sphaera in Paris 
(1521). 


Figure5 Ptolemy observing the stars with a quadrant, 
together with an allegoric Astronomia. (From Gregorius 
Reish, Margarita Philosophica nova, Strasbourg, 1512, 
an early compendium of philosophy and science.) In 
those days, Ptolemy was often depicted as a king, because 
he was erroneously thought to be descended from the 
Ptolemaic dynasty that ruled Egypt after Alexander. 
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Most of Greek geometry was codified by Euclid in his Elements (of Mathematics). 
Actually the Elements consist of thirteen books, in which Euclid collected most of 
the mathematical knowledge of his age (circa 300 B.c.), transforming it into a lucid, 
logically developed masterpiece. In addition to the Elements, some of Euclid's other 
writings were also handed down to us, including his Optics and the Catoptrica (theory 
of mirrors). 

The success of Greek mathematics had a profound effect on views of nature. 
The Platonists, or followers of Plato, distinguished between the world of ideas and 
the world of physical objects. Plato was the first to propose that ultimate truth or 
understanding could not come from the material world, which is constantly subject to 
change, but only from mathematical models or constructs. Thus, infallible knowledge 
could be attained only through mathematics. Plato not only wished to use mathematics 
in the study of nature, but he actually went so far as to attempt to substitute mathematics 
for nature. For Plato, reality lies only within the realm of ideas, especially mathematical 
ideas. 

Not everyone in antiquity agreed with this point of view. Aristotle, a student of 
Plato, criticized Plato's reduction of science to the study of mathematics. Aristotle 
thought that the study of the material world was one's primary source of reality. 
Despite Aristotle's critique, the view that mathematical laws governed the universe 
took a firm hold on classical thought. The search for the mathematical laws of nature 
was underway. 

After the death of Archimedes in 212 B.c., Greek civilization went into a period of 
slow decline. The final blow to Greek civilization came in 640 A.D. with the Moslem 
conquest of Egypt. The remaining Greek texts housed in the great library in Alexandria 
were burned. Those scholars who survived migrated to Constantinople (now part of 
Turkey), which had become the capitol of the Eastern Roman Empire. It was in this 
great city that what survived of Greek civilization was preserved for its rediscovery 
by European civilization some five hundred years later. 


Indian and Arabian Mathematics 


Mathematical activity did not, however, cease with the decline of Greek civilization. In 
the middle of the sixth century, somewhere in the Ganges Valley in India, our modern 
system of numeration evolved. The Indians developed a number system based on ten, 
with ten rather abstract symbols from zero to nine looking “roughly” as they do today. 
They developed rules for addition, multiplication, and division (as we have today), a 
system infinitely superior to the Roman abacus, which was used (by a special class 
of servants called arithmeticians) throughout Europe until the fifteenth century. See 
Figure 6. 

After the fall of Egypt, came the rise of Arab civilization centered in Baghdad. 
Scholars from Constantinople and India were invited to study and to share their 
knowledge. It was through these contacts that the Arabs came to acquire the learning of 
the ancients as well as the newly discovered Indian system of numeration. See Figure 7. 

It was the Arabs who gave us the name Algebra, which comes from the book by 
the astronomer Mohammed ibn Musa al-Khuwarizmi titled “Al-Jabr w’al muqabola;" 
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Figure 6 Arithmetician performing 
a calculation on a counter-abacus. 


which means “restoring” or “balancing” (equations). Al-Khuwarizmi is also respon- 
sible for a second profoundly influential book entitled “Kitab al jami’ wa’l tafriq bi 
hisab al hind" (Indian Technique of Addition and Subtraction), which described and 
clarified the Indian decimal place value system. 

Al-Khuwarizmi also gave us another name for a fundamental branch of science, 
the word algorithm. Latinized, his name became Algorism, then Algorismus, and 
finally Algorithm. The term initially represented the Indian system of numeration, 
but ultimately came to be used in its modern computational sense. 

The decline of Arab civilization coincided with the rise of European civilization. 
The dawn of the modern age began when Richard the Lionhearted reached the walls 
of Jerusalem. From approximately 1192 through around 1270, the Christian knights 
brought the learning of the “infidels” back to Europe. Around 1200-1205, Leonardo 
of Pisa (also known as Fibonacci), who had traveled extensively in Africa and Asia 
Minor, wrote his interpretation (in Latin) of Arabic and Greek mathematics. His 
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Figure 7 Detail from the Codex Vigilanus (976 A.D. northern Spain). The 
first known occurrence of the nine Indo-Arabic numerals in Western Europe. 
(Escurial Library, Madrid.) 
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historic texts brought the work of al-Khuwarizmi and Euclid to the attention of a 
large audience in Europe. 


European Mathematics 


Around 1450 Johann Gutenberg invented the printing press with movable type. This, 
combined with the advent of linen and cotton paper obtained from the Chinese, 
dramatically increased the rate of the dissemination of knowledge. The steep rise in 
trade and manufacturing fueled the growth of wealth and dramatic change in European 
societies from feudal to city-states. In Italy, the mother of the Renaissance, we see the 
rise of extraordinarily wealthy states such as Venice under the Doges and Florence 
under the Medicis. 

The needs of the rising merchant class accelerated the adoption of the Indian 
system of numeration. The teachings ofthe Catholic Church, which rested on absolute 
authority and dogma, began to be challenged by the ideas of Plato. From Plato, scholars 
learned that the world was rational and could be understood, and that the means of 
understanding nature was through mathematics. But this sharply contradicted the 
teachings of the church, which taught that God designed the universe. The only 
possible resolution of this apparent contradiction was that “God designed the universe 
mathematically” or that “God is a mathematician.” 

It is perhaps surprising how much this point of view inspired the work of many 
sixteenth- to eighteenth-century mathematicians and scientists. For if this were indeed 
the case, then by understanding the mathematical laws of the universe, one could come 
closer to an understanding of the Creator himself. Believe it or not, this point of view 
survives to this day. The following is a quote from Paul Dirac, a Nobel Prize-winning 
physicist and a creator of modern quantum mechanics. 


It seems to be one of the fundamental features of nature that fundamental 
physical laws are described in terms of a mathematical theory of great beauty 
and power, needing quite a high standard of mathematics for one to understand 
it. You may wonder: Why is nature constructed along these lines? One can only 
answer that our present knowledge seems to show that nature is so constructed. 
We simply have to accept it. One could perhaps describe the situation by saying 
that God is a mathematician of a very high order, and He used very advanced 
mathematics in constructing the universe. Our feeble attempts at mathematics 
enable us to understand a bit of the universe, and as we proceed to develop higher 
and higher mathematics we can hope to understand the universe better. 


Mathematics began to see further advances and applications. In the sixteenth and 
seventeenth centuries, al-Khuwarizmi’s algebra was significantly advanced by Car- 
dano, Vieta, and Descartes. The Babylonians had solved the quadratic equation, but 
now two thousand years later, del Ferro and Tartaglia solved the cubic equation, 
which in turn led to the discovery of imaginary numbers. These imaginary numbers 
were later to play a fundamental role, as we shall see, in the development of vector 
calculus. In the early seventeenth century, Descartes, perhaps motivated by the grid 
technique used by Italian fresco painters to locate points on a wall or canvas, created, 
in a moment of great mathematical inspiration, coordinate (or analytic) geometry. 
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This new mathematical model enables one to reduce Euclid's geometry to algebra 
and provides a precise and quantitative method to describe and calculate with space 
curves and surfaces. 

Early on, Archimedes' great work in statics and equilibrium (centers of gravity, 
the principle of the lever—which we study in this book) was absorbed and improved 
upon, leading to dramatic engineering achievements. In a building spree that remains 
astonishing to this day, engineering advances made possible the rise of an incredible 
number of cathedrals throughout Europe, including the stunning Duomo in Florence, 
Notre Dame in Paris, and the Great Cathedral in Cologne, to mention a few. See 
Figure 8. 


Figure 8 Duomo. 


Filippo Brunelleschi (1377-1446) studied the works of Euclid and Hipparchus and 
was the first artist to employ mathematics extensively. The mathematical principles of 
perspective were eventually completed by Piero della Francesca (1410—1492). Math- 
ematicians and engineers were recruited by warring princes to fuel the development 
of advanced weapons and ballistic science. The most famous among these was none 
other than Leonardo da Vinci, who in the last years of his life was employed by the 
Duke of Milan. It was in these final years that he painted the “Mona Lisa,” now housed 
in the Louvre in Paris. See Figure 9. 

However, as in Greek times, it was astronomy that was to give mathematics its 
greatest impetus. It is not surprising that the Greek astronomers placed the Earth and 
not the sun at the center of our universe, because on a daily basis we see the sun 
both rise and set. Still, it is interesting to ask if the Greeks, who were such marvelous 
thinkers, at least tested the heliocentric theory, which places the sun at the center ofthe 
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Figure 9 Leonardo, self portrait. 


universe. In fact, they did. In the third century B.c., Aristarchus of Samus taught that 
the Earth and other planets move in circular orbits around a fixed sun. His hypotheses 
were, for several reasons, rejected. First, the opposing astronomers reasoned that if 
the Earth were indeed moving, one should be able to sense it. Second, how would 
objects, circulating with us, be able to stay on a moving Earth? Third, why are the 
clouds not lagging behind the moving Earth? 

Such arguments were to be used again in the sixteenth century against the Polish 
astronomer Nicolas Copernicus (see Figure 10), who in 1543 introduced the helio- 
centric theory (the planets move in orbit around the sun). His book Revolutionibus 
Orbium Coelestium (On the Revolution of the Heavenly Orbits) was to initiate the 
“Copernican revolution" in science and to give the world a new word, revolutionary. 

In 1619, the German astronomer Johannes Kepler (see Figure 11), using the astro- 
nomical calculations ofthe Danish astronomer Tycho Brahe, showed that the planetary 


Figure 10 Nicolaus Copernicus (1473-1543). 
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orbits were in fact elliptical, the same ellipses that the Greeks had studied as abstract 
forms some 2000 years earlier (see Figure 12). 

But Kepler's law of elliptical orbits was only one of three laws he discovered 
governing planetary motion. Kepler's second law states that if a planet moves from 
a point A to another point B in a certain amount of time T , and also moves from A^ 
to B' in the same time, and if S is a focus of the orbital ellipse, then the sections SAB 
are SA'B' have equal areas (see Figure 13). Kepler's third law was that the square of 
time T a planetary body requires to complete an orbit is proportional to a°, where a 
is the great axis of the elliptical orbit. In equation form, T? = Ka?, where K is some 
constant (we shall derive this law for circular orbits in Chapter 4). 
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Figure 11 Johannes Kepler Figure 12 The motion of Mars. From 
(1571-1630). Kepler’s Astronomia Nova (1609). 
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Figure 13 Kepler’s second law. 


Profound as these observations were, an explanation of why these laws held was 
lacking. However, by the middle of the seventeenth century, it was fully understood that 
a change of velocity requires the action of forces, but how these forces influenced mo- 
tion was not at all clear. In 1674 Robert Hooke, in an attempt to explain Kepler's laws, 
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assumed the existence of an attractive force the sun must exert on the planets, a force 
that decreased with planetary distance. Hooke’s theory, however, was only qualitative. 


Newton 

What was also seriously lacking was a quantitative, precise definition of both velocity 
and acceleration. This was ultimately solved by the invention of calculus by both 
Isaac Newton and Gottfried Wilhelm Leibniz (see Figure 14). Hooke was never able 
to achieve an understanding of the profound ideas behind the infinitesimal calculus. 
However, during the period of 1679-1680 Hooke discussed his ideas with Newton, 
including his conjecture that the force the sun exerts on the planets was actually 
inversely proportional to the square of the planetary distance. 


Figure 14 Gottfried Wilhelm Leibniz (1646-1716). 


After Sir Christopher Wren, amateur astronomer, architect of the city of London 
and London's magnificent St. Paul's Cathedral, issued a public challenge to “theo- 
retically determine" the orbits of the planets, Isaac Newton took a serious interest in 
the problem. Perhaps acting on rumors, the great British astronomer Edmund Halley 
(1656—1743) in August 1684 visited Newton in Cambridge and asked him directly 
what the orbit of a planet would be under an inverse square force. Newton answered 
that it had to be an ellipse. As the stunned Halley asked him how he knew this, New- 
ton’s famous reply was “Why I have calculated it^ Halley ultimately urged Newton to 
publish his results as a book, and these appeared in 1686 in Newton's now legendary 
Principia. See Figure 15. 

This book, often and justly referred to as the foundation of modern science, had 
an immediate dramatic impact. Alexander Pope wrote: 


Nature and nature's laws lay hid at night, 
God said, "Let Newton be" and all was light. 


On the front cover of this text, we see Newton holding open a copy of his Principia. 
Although Newton did not use calculus in the Principia, convincing arguments 
have been put forward that Newton originally used his calculus to derive the 
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Figure 15 The frontispiece of the 
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Bibliopolas,” which is sometimes called 


M AT H E M A T I C A. the "first issue" of the first edition. The 


“export copy" (with the three lines 
“Prostant Venales apud Sam Smith... 


Autore FS. NEWTON, Trin. Coll. Cantab. Soc. Mathefeos aliosq; nonnullos Bibliopolas”) is called 
Profeffore Lucafiano, & Societatis Regalis Sodali. the second issue of the first edition. This 


distinction between the first and second 


Jula Societatis Regie ac Typis Fofephi Streater. Proftat apud 
plores Bibliopolas. Æo MDCLXXXVIL. 


IMPRIMATUR issues seems to be quite unfounded. It has 
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been suggested that Halley made an 
agreement with Smith concerning foreign 
sales; in fact, most of Smith’s fifty copies 
were apparently sold on the continent. 


jui 5. 1686. 
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trajectories of the planetary orbits from the inverse square law.* The Principia pro- 
vided profound evidence that the universe, as the early Greeks had understood, was 
indeed designed mathematically. Incidentally, it was Newton who first conceptualized 
force as a vector, although he provided no formal definition of what a vector was. 
Such a formal definition had to wait for William Rowan Hamilton, a century and a 
half after the Principia. It was for this achievement and his creation of calculus itself 
that we chose Newton for our cover. 

The invention of the calculus and the subsequent development of vector calcu- 
lus was the true beginning of modern science and technology, which has changed 
our world so dramatically. From the mathematics of Newton's mechanics to the pro- 
found intellectual constructs of Maxwell's electrodynamics, Einstein's relativity, and 
Heisenberg's and Schródinger's quantum mechanics, we have seen the discoveries 
of radio, television, wireless communications, flight, computers, space travel, and 
countless engineering marvels. 

Underlying all these developments was mathematics, an exciting adventure of the 
mind and a celebration of the human spirit. It is in this context that we begin our 
account of Vector calculus. 


* We shall study the problem of planetary orbits in Section 4.1 and further in the Internet supplement. 
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We assume that students have studied the calculus of functions of a real variable, 
including analytic geometry in the plane. Some students may have had some exposure 
to matrices as well, although what we shall need is given in Sections 1.3 and 1.5. 

We also assume that students are familiar with functions of elementary calculus, 
such as sin x, cosx, e*, and logx (we write logx or Inx for the natural logarithm, 
which is sometimes denoted log, x). Students are expected to know, or to review as the 
course proceeds, the basic rules of differentiation and integration for functions of one 
variable, such as the chain rule, the quotient rule, integration by parts, and so forth. 

We now summarize the notations to be used later. Students can read through these 
quickly now, then refer to them later if the need arises. 

The collection of all real numbers is denoted R. Thus R includes the integers, . . . , 
—3, —2, —1,0, 1, 2,3, ...; the rational numbers, p/q, where p and q are integers 
(q £ 0); and the irrational numbers, such as 4/2, x, and e. Members of R may be 
visualized as points on the real-number line, as shown in Figure P.1. 


-3 -2 -1 0 +1 2 2 e 3m 


Figure P.1 The geometric representation of points on the real-number line. 


When we write a € IR we mean that a is a member of the set R, in other words, that 
a is a real number. Given two real numbers a and b witha < b (that is, with a less than 
b), we can form the closed interval [a, b], consisting of all x such that a < x < b, 
and the open interval (a, b), consisting of all x such that a < x < b. Similarly, we 
can form half-open intervals (a, b] and [a, b) (Figure P2). 


a b c d e y 
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Figure P.2 The geometric representation of the intervals [a, b], (c, d), and [e, f). 
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Prerequisites and Notation 


The absolute value of a number a € R is written |a| and is defined as 


a ifa>0 
—a ifa « 0. 


lal ={ 


For example, |3| = 3, |-3| = 3, |0| = 0, and |—6| = 6. The inequality |a + b| < 
la| + |b| always holds. The distance from a to b is given by |a — b|. Thus, the 
distance from 6 to 10 is 4 and from —6 to 3 is 9. 

If we write A C R, we mean A is a subset of R. For example, A could equal the set 
of integers {..., —3, —2, —1, 0, 1, 2, 3, . . .}. Another example of a subset of R is the 
set Q of rational numbers. Generally, for two collections of objects (that is, sets) A 
and B, A C B means A is a subset of B; that is, every member of A is also a member 
of B. 

The symbol A U B means the union of A and B, the collection whose members 
are members of either A or B (or both). Thus, 


{...,—3, 2, 1,0] U (7,0, 1,2,...] = (..., 73, 22, 71,0, 1,2, .. .. 


Similarly, AM B means the intersection of A and B; that is, this set consists of those 
members of A and B that are in both A and B. Thus, the intersection of the two sets 
above is (—1, 0]. 

We shall write A\B for those members of A that are not in B. Thus, 


{...,-3, 22, -1,0N(71,0,1,2, ...] = ÉL... 73, —2}. 
We can also specify sets as in the following examples: 


{a eR | a is an integer} = (..., 3, 2 21,0, 1,2, ...] 
{a € R | a is an even integer] = (..., —2, 0,2, 4,...} 


{x €R|a<x <b} =[a, bd]. 


A function f: A — B is a rule that assigns to each a € A one specific member f(a) 
of B. We call A the domain of f and B the target of f. The set (f(x) | x € A} 
consisting of all the values of f(x) is called the range of f. Denoted by f(A), the 
range is a subset of the target B. It may be all of B, in which case f is said to be 
onto B. The fact that the function f sends a to f(a) is denoted by a +> f(a). For 
example, the function f(x) — x?/(1 — x) that assigns the number x?/(1 — x) to each 
x # 1 in R can also be defined by the rule x  x?/(1 — x). Functions are also called 
mappings, maps, or transformations. The notation f: A C IR — IR means that A is 
a subset of IR and that f assigns a value f(x) in R to each x € A. The graph of f 
consists of all the points (x, f(x)) in the plane (Figure P.3). 
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Graph of f 
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Figure P.3 The graph of a function with the 
half-open interval A as domain. 
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A = domain 


The notation 7. , a; means a, +--+ + an, where aj, ..., a, are given numbers. 
The sum of the first n integers is 


n 
1 
142+--tn= ist ) 


i=l 
The derivative of a function f(x) is denoted f'(x), or 


af 
dx 


and the definite integral is written 


b 
[ f(x) dx. 
a 
If we set y = f(x), the derivative is also denoted by 


dy 
dx’ 


Readers are assumed to be familiar with the chain rule, integration by parts, and 
other basic facts from the calculus of functions of one variable. In particular, they 
should know how to differentiate and integrate exponential, logarithmic, and trigono- 
metric functions. Short tables of derivatives and integrals, which are adequate for the 
needs of this text, are printed at the front and back of the book. 

The following notations are used synonymously: e* = expx, Inx = logx, and 
sin! x = arcsin x. 

The end of a proof is denoted by the symbol m, while the end of an example or 
remark is denoted by the symbol a. 
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Euclidean Space 


Quaternions came from Hamilton . . . and have been an unmixed evil to 
those who have touched them in any way. Vector is a useless survival . . . 
and has never been of the slightest use to any creature. 


Lord Kelvin 


n this chapter we consider the basic operations on vectors in two- and three- 

dimensional space: vector addition, scalar multiplication, and the dot and cross 
products. In Section 1.5 we generalize some of these notions to n-space and review 
properties of matrices that will be needed in Chapters 2 and 3. 


1.1 Vectors in Two- and Three-Dimensional Space 


Points P in the plane are represented by ordered pairs of real numbers (ai, a2); the 
numbers a, and a» are called the Cartesian coordinates of P. We draw two perpen- 
dicular lines, label them as the x and y axes, and then drop perpendiculars from P to 
these axes, as in Figure 1.1.1. After designating the intersection of the x and y axes 
as the origin and choosing units on these axes, we produce two signed distances a; 
and a» as shown in the figure; a; is called the x component of P, and az is called the 
y component. 

Points in space may be similarly represented as ordered triples of real numbers. 
To construct such a representation, we choose three mutually perpendicular lines 
that meet at a point in space. These lines are called x axis, y axis, and z axis, and the 
point at which they meet is called the origin (this is our reference point). We choose 
a scale on these axes, as shown in Figure 1.1.2. 
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Figure 1.1.1 Cartesian coordinates in the plane. 


Figure 1.1.2 Cartesian coordinates in space. 


The triple (0, 0, 0) corresponds to the origin of the coordinate system, and the 
arrows on the axes indicate the positive directions. For example, the triple (2, 4, 4) 
represents a point 2 units from the origin in the positive direction along the x axis, 
4 units in the positive direction along the y axis, and 4 units in the positive direction 
along the z axis (Figure 1.1.3). 


Figure 1.1.3 Geometric representation of the point (2, 4, 4) 
in Cartesian coordinates. 


Because we can associate points in space with ordered triples in this way, we often 
use the expression “the point (ai, a2, a3)” instead of the longer phrase “the point P 
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that corresponds to the triple (ai, a2, a3).” We say that a; is the x coordinate (or first 
coordinate), a» is the y coordinate (or second coordinate), and a3 is the z coordinate 
(or third coordinate) of P. It is also common to denote points in space with the letters 
x, y, andz in place of a), a», and a3. Thus, the triple (x, y, z) represents a point whose 
first coordinate is x, second coordinate is y, and third coordinate is z. 

We employ the following notation for the line, the plane, and three-dimensional 
space: 


(i) The real number line is denoted R! or simply IR. 
(ii) The set of all ordered pairs (x, y) of real numbers is denoted R?. 
(iii) The set of all ordered triples (x, y, z) of real numbers is denoted R?. 


When speaking of R!, R2, and IR? simultaneously, we write IR", where n = 1,2, 
or 3; or R”, where m = 1, 2,3. Starting in Section 1.5 we will also study R” for 
n = 4,5,6, ..., but the cases n = 1, 2,3 are closest to our geometric intuition and 
will be stressed throughout the book. 


Vector Addition and Scalar Multiplication 


The operation of addition can be extended from IR to R? and IR?. For R?, this is done 
as follows. Given the two triples (a), a2, a3) and (bi, b2, b3), we define their sum to be 


(a1, a2, 43) + (bi, b2, b3) = (a1 + bi, a2 + b2, a3 + bs). 


EXAMPLE 1 


(1, 1, 1) + (2, ~3, 4) = (3, —2, 5), 
(x,y,z) + (0, 0, 0) = (x, y, z), 
(1,7,3)+(@,b,c)=(1+4,7+b,3+0). a 


The element (0, 0, 0) is called the zero element (or just zero) of IR?. The element 
(—ai, —a2, —a3) is the additive inverse (or negative) of (ai, a2, a3), and we will write 
(ai, a2, 43) — (bi, b2, b3) for (a1, a2, a3) + (—bi, —b2, —b3). 

The additive inverse, when added to the vector itself, of course produces zero: 


(a1, a2, a3) + (—a1, —a2, —a3) = (0, 0, 0). 


There are several important product operations that we will define on R?. One 
of these, called the inner product, assigns a real number to each pair of elements of 
T2. We shall discuss it in detail in Section 1.2. Another product operation for IR? is 
called scalar multiplication (the word “scalar” is a synonym for “real number"). This 
product combines scalars (real numbers) and elements of R? (ordered triples) to yield 
elements of IR? as follows: Given a scalar œ and a triple (a), a2, a3), we define the 
scalar multiple by 


(4), az, a3) = (aa), ap, a3). 
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EXAMPLE 2 


2(4, e, 1) = (2 -4,2 - e, 2 - 1) = (8, 2e, 2), 
6(1, 1, 1) = (6, 6, 6), 
l(u, v, w) = (u, v, w), 


0(p.q,r) = (0,0,0). 4 


Addition and scalar multiplication of triples satisfy the following properties: 


G) (@B)(a1, a2, a3) = o [B(ai, a2, a3)] (associativity) 
Gi) (@ + B)(ai, a2, a3) = a (a1, a2, a3) + (a1, a2, a3) (distributivity) 
(iii) o (a1, a2, a3) + (bı, b2, b3)] = æ(a1, a2, a3) + (bi, b2, b3) (distributivity) 
(iv) o(0, 0, 0) = (0, 0, 0) (property of zero) 
(v) O(a, a2, a3) = (0, 0, 0) (property of zero) 
(vi) 1(a1, a2, a3) = (a1, a2, a3) (property of the 


unit element) 
The identities are proven directly from the definitions of addition and scalar 
multiplication. For instance, 
(œ + BY(ai, a2, a3) = ((a@ + B)ai, (x + B)az, (a + B)a3) 
= (aa; + Bay, aa» + Ba», as + Bas) 
= aai, a2, a3) + B(a1, a2, a3). 


For R?, addition and scalar multiplication are defined just as in R?, with the third 
component of each vector dropped off. All the properties (i) to (vi) still hold. 


AIJE] Interpret the chemical equation 2NH? + H2 = 2NH; as a relation 
in the algebra of ordered pairs. 


SOLUTION We think of the molecule N.H, (x atoms of nitrogen, y atoms of 
hydrogen) as represented by the ordered pair (x, y). Then the chemical equation given 
is equivalent to 2(1, 2) + (0, 2) = 2(1, 3). Indeed, both sides are equal to (2, 6). A 


Geometry of Vector Operations 


Let us turn to the geometry of these operations in R? and R°. For the moment, we define 
a vector to be a directed line segment beginning at the origin, that is, a line segment 
with specified magnitude and direction, and initial point at the origin. Figure 1.1.4 
shows several vectors, drawn as arrows beginning at the origin. In print, vectors are 
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usually denoted by boldface letters such as a. By hand, we usually write them as a or 
simply as a, possibly with a line or wavy line under it. 


Figure 1.1.4 Geometrically, vectors are thought of as 


arrows emanating from the origin. 
y 


Using this definition of a vector, we associate with each vector a the point 
(a1, a2, a3) where a terminates, and conversely, we can associate a vector a with 
each point (a), a2, a3) in space. Thus, we shall identify a with (a), a2, a3) and write 
a = (a1, a2, a3). For this reason, the elements of R? not only are ordered triples of 
real numbers, but are also regarded as vectors. The triple (0, 0, 0) is denoted 0. 
We call a;, a2, and a3 the components of a, or when we think of a as a point, its 
coordinates. 

Two vectors a = (ai, a2, a3) and b = (bi, b2, b3) are equal if and only ifa, = by, 
a = bz, and a3 = b3. Geometrically this means that a and b have the same direction 
and the same length (or “magnitude”). 

Geometrically, we define vector addition as follows. In the plane containing the 
vectors a = (a1, a2, a3) and b = (bi, b2, b3) (see Figure 1.1.5), form the parallelogram 
having a as one side and b as its adjacent side. The sum a + b is the directed line 
segment along the diagonal of the parallelogram. 


[ae / Figure 1.1.5 The geometry of vector addition. 
a y 


This geometric view of vector addition is useful in many physical situations, as 
we shall see in the next section. For an easily visualized example, consider a bird or 
an airplane flying through the air with velocity v;, but in the presence of a wind with 
velocity v2. The resultant velocity, v; + v2, is what one sees; see Figure 1.1.6. 
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Figure 1.1.6 A physical interpretation of vector addition. 


To show that our geometric definition of addition is consistent with our alge- 
braic definition, we demonstrate that a + b = (a; + bi, a2 + b2, a3 + b3). We shall 
prove this result in the plane and leave the proof in three-dimensional space to the 
reader. Thus, we wish to show that if a = (a1, a2) and b = (bj, b2), then a+b = 
(a1 + bi, a2 + b2). 

In Figure 1.1.7 let a = (a1, a2) be the vector ending at the point A, and let 
b = (bi, b2) be the vector ending at point B. By definition, the vector a + b ends 
at the vertex C of parallelogram OBCA. To verify that a + b = (ai + bi, a2 + b2), 
it suffices to show that the coordinates of C are (a; + bi, a2 + b2). The sides of 
the triangles OAD and BCG are parallel, and the sides OA and BC have equal 
lengths, which we write as OA = BC. These triangles are congruent, so BG = OD; 
since BGFE is a rectangle, EF = BG. Furthermore, OD = a, and OE = b;. Hence, 
EF = BG = OD = a,. Since OF = EF + OE, it follows that OF = a, + bı. This 
shows that the x coordinate of a + b is a; + bi. The proof that the y coordinate 
is a + b; is analogous. This argument assumes A and B to be in the first quadrant, 
but similar arguments hold for the other quadrants. 


A= (ay, a2) 


Figure 1.1.7 The construction used to prove that 
(a1, a2) + (bi, b2) = (ai + bi, a2 + b2). 


Figure 1.1.8(a) illustrates another way of looking at vector addition: in terms 
of triangles rather than parallelograms. That is, we translate (without rotation) the 
directed line segment representing the vector b so that it begins at the end of the 
vector a. The endpoint of the resulting directed segment is the endpoint of the vector 
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a + b. We note that when a and b are collinear, the triangle collapses to a line segment, 
as in Figure 1.1.8(b). 


y y 


(a) (b) 


Figure 1.1.8 (a) Vector addition may be visualized in terms of triangles as well 
as parallelograms. (b) The triangle collapses to a line segment when a and b are 
collinear. 


In Figure 1.1.8 we have placed a and b head to tail. That is, the tail of b is placed 
at the head of a, and the vector a + b goes from the tail of a to the head of b. If we do 
it in the other order, b 4- a, we get the same vector by going around the parallelogram 
the other way. Consistent with this figure, it is useful to let vectors “glide” or “slide,” 
keeping the same magnitude and direction. We want, in fact, to regard two vectors 
as the same if they have the same magnitude and direction. When we insist on vec- 
tors beginning at the origin, we will say that we have bound vectors. If we allow 
vectors to begin at other points, we will speak of free vectors or just vectors. 


Vectors Vectors (also called free vectors) are directed line segments in [the 
plane or] space represented by directed line segments with a beginning (tail) 
and an end (head). Directed line segments obtained from each other by parallel 
translation (but not rotation) represent the same vector. 

The components (di, a2, a3) of a are the (signed) lengths of the projections 


of a along the three coordinate axes; equivalently, they are defined by placing 
the tail of a at the origin and letting the head be the point (a), a2, a3). We write 
a = (a1, a5, a3). 

Two vectors are added by placing them head to tail and drawing the vectors 
from the tail of the first to the head of the second, as in Figure 1.1.8. 


Scalar multiplication of vectors also has a geometric interpretation. If œ is a scalar 
and a a vector, we define wa to be the vector that is |o| times as long as a, with the 
same direction as a if œ > 0, but with the opposite direction if œ < 0. Figure 1.1.9 
illustrates several examples. 
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»- 


Nile 


Figure 1.1.9 Some scalar multiples of a vector a. 


Using an argument based on similar triangles, one finds that if a = (a, a2, a3), 
and o is a scalar, then 


aa = (aa), (a5, &a3). 


That is, the geometric definition coincides with the algebraic one. 

Given two vectors a and b, how do we represent the vector b — a geometrically, 
that is, what is the geometry of vector subtraction? Because a + (b — a) = b, we 
see that b — a is the vector that one adds to a to get b. In view of this, we may 
conclude that b — a is the vector parallel to, and with the same magnitude as, the 
directed line segment beginning at the endpoint of a and terminating at the endpoint 
of b when a and b begin at the same point (see Figure 1.1.10). 


Figure 1.1.10 The geometry of vector subtraction. 


PGW thas) Letu and v be the vectors shown in Figure 1.1.11. Draw the two 
vectors u 4- v and —2u. What are their components? 
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Figure 1.1.11 Find u + v and —2u. 


SOLUTION Place the tail of v at the tip of u to obtain the vector shown in 
Figure 1.1.12. 


Figure 1.1.12 Computing u 4- v and 
—2u. 


The vector —2u, also shown, has length twice that of u and points in the opposite 
direction. From the figure, we see that the vector u + v has components (5, 2) and 
—2u has components (—6, —4). A 


| EXAMPLE 5 


(a) Sketch —2v, where v has components (—1, 1, 2). 
(b) Ifv and w are any two vectors, show that v — iw and 3v — w are parallel. 
SOLUTION 


(a) The vector —2v is twice as long as v, but points in the opposite direction (see 
Figure 1.1.13). 


(b) v-— $w = 4(3v — w); vectors that are multiples of one another are parallel. A 
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(-1, 1,2) 


Figure 1.1.13 Multiplying (—1, 1, 2) 
by —2. 


(2,-2,-4) 


'The Standard Basis Vectors 


To describe vectors in space, it is convenient to introduce three special vectors along 
the x, y, and z axes: 


i: the vector with components (1, 0, 0) 
j: the vector with components (0, 1, 0) 


k: the vector with components (0, 0, 1). 


These standard basis vectors are illustrated in Figure 1.1.14. In the plane one has the 
standard basis i and j with components (1, 0) and (0, 1). 


Zz 


n 0,1) 


Figure 1.1.14 The standard basis vectors. 


Let a be any vector, and let (ai, a2, a3) be its components. Then 
a = aji + ajj + ask, 
because the right-hand side is given in components by 
ai(1, 0, 0) + a2(0, 1, 0) + a3(0, 0, 1) = (a1, 0, 0) + (0, a2, 0) + (0, 0, a3) 


= (ai, a2, a3). 
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Thus, we can express every vector as a sum of scalar multiples of i, j, and k. 


The Standard Basis Vectors 


1. The vectors i, j, and k are unit vectors along the three coordinate axes, as 
shown in Figure 1.1.14. 


2. Ifa has components (a), a2, a3), then 


a=ai+aj+a3k. 


[SN Ue} Express the vector whose components are (e, zr, —4/3) in the stan- 
dard basis. 


SOLUTION Substituting a; = e, a? = 7, and a3 = 43 into a = aji + aj+ 
a3k gives 


v=ei+nj — 3k. A 


—jt4k A 


Figure 1.1.15 Representation of (2, 3, 2) in terms 
of the standard basis vectors i, j, and k. 


Addition and scalar multiplication may be written in terms of the standard basis 
vectors as follows: 


(aii + aj + a3 K) + (bii + bj + b3K) = (a1 + bi) + (a2 + b2)j + (a3 + b3)k 
and 


o (aj + aj + a3k) = («a)i + (6a2)j + o (a3)k. 
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The Vector Joining Two Points 
To apply vectors to geometric problems, it is useful to assign a vector to a pair of 
points in the plane or in space, as follows. Given two points P and P’, we can draw 
—, 
the vector v with tail P and head P’, as in Figure 1.1.16, where we write PP’ for v. 


PP’ à 
P 
Figure 1.1.16 The vector from P to P' is denoted PP'. 


If P = (x, y, z) and P' = (x', y’, z^), then the vectors from the origin to P and P’ 
— 
are a = xi + yj + zk and a’ = x'i + y'j +2’k, respectively, so the vector PP’ is the 
difference a’ — a = (x' — x)i + (y' — y)j + (z' — z)k. (See Figure 1.1.17.) 


: pp’ — OB nb 
Figure 1.1.17 PP’ = OP’ — OP. 


a'a p 
ae 
Aa 
r 


The Vector Joining Two Points If the point P has coordinates (x, y, z) and 


cm 
P’ has coordinates (x, y’, z^), then the vector PP’ from the tip of P to the tip of 
P' has components (x' — x, y' — y, z' — z). 


EXAMPLE 8 


(a) Find the components of the vector from (3, 5) to (4, 7). 
(b) Add the vector v from (—1, 0) to (2, —3) and the vector w from (2, 0) to (1, 1). 


(c) Multiply the vector v in (b) by 8. If the resulting vector is represented by the 
directed line segment from (5, 6) to Q, what is Q? 


SOLUTION 


(a) As in the preceding box, we subtract the ordered pairs: (4, 7) — (3, 5) = (1, 2). 
Thus the required components are (1, 2). 
(b) The vector v has components (2, —3) — (—1, 0) = (3, —3), and w has com- 


ponents (1, 1) — (2, 0) = (—1, 1). Therefore, the vector v 4- w has compo- 
nents (3, —3) + (—1, 1) = (2, —2). 
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(c) The vector 8v has components 8(3, —3) — (24, —24). If this vector is rep- 
resented by the directed line segment from (5, 6) to Q, and Q has coordi- 
nates (x, y), then (x, y) — (5, 6) = (24, —24), so (x, y) = (5, 6) + 24, —24) = 
(29, 18). a 


EWIDO) Let P= (-2, —1), Q = (—3, 3) and R = (—1, —4) in the xy 


plane. 


(a) Draw these vectors: v joining P to Q; w joining Q to R; u joining R to P. 
(b) What are the components of v, w, and u? 

(c) What is v 4- w+ u? 

SOLUTION 

(a) SeeFigure 1.1.18. 


Figure 1.1.18 The vector v joins P to Q; w joins Q to R; 
and u joins R to P. 


(b) Because v — PÓ, w — QR, and u = RP, we get 
v = (3, —3) - (-2, -1) = (-1, —2), 
w = (—1, —4) — (-3, -3) = (2, -1), 
uz —(—1, —4) 4- (-2, -1) « (-1, 3). 


() v+w+us(-1, —2)+(2,-1)+(-1,3)=(0,0). à 


Geometry Theorems by Vector Methods 


Many of the theorems of plane geometry can be proved by vector methods. Here is 
one example. 
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E 


M828 290} Use vectors to prove that the diagonals of a parallelogram bisect 
each other. 


SOLUTION Let OPRQ be the parallelogram, with two adjacent sides represented 


by the vectors a = OP and b = 00. Let M be the midpoint of the diagonal OR, N 
the midpoint of the other diagonal, PQ. (See Figure 1.1.19.) 


» R Figure 1.1.19 Ifthe midpoints M and N 
coincide, then the diagonals OR and PQ 
o bisect each other. 


Observe t that OR — oP + 00 = a + b by the parallelogram rule for vector ad- 
dition, so OM = 1OR = }(a + b). On the other hand, 


—-0Q-OP-b-a, so  PN-IPQ-i(b-a) 


and hence 


> > > 1 1 
ON = OP + PN = a + 1(b — a) = 1(a + b). 


Because OM and ON are equal vectors, the points M and N coincide, so the diagonals 
bisecteach other. A 


Equations of Lines 


Planes and lines are geometric objects that can be represented by equations. We shall 
defer until Section 1.3 a study of equations representing planes. However, using the 
geometric interpretation of vector addition and scalar multiplication, we will now 
find the equation of a line | that passes through the endpoint of the vector a, with the 
direction of a vector v (see Figure 1.1.20). 

As t varies through all real values, the points of the form tv are all scalar 
multiples of the vector v, and therefore exhaust the points of the line passing through 
the origin in the direction of v. Because every point on/ is the endpoint ofthe diagonal 
of a parallelogram with sides a and tv for some real value of t, we see that all the 
points on / are of the form a + tv. Thus, the line / may be expressed by the equation 
I(t) = a + tv. We say that / is expressed parametrically, with t the parameter. At 
t = 0, I(t) = a. Ast increases, the point I(t) moves away from a in the direction of v. 
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Figure 1.1.20 The line /, parametrically 
given by I(t) = a + tv, lies in the direction 
v and passes through the tip of a. 


As t decreases from ¢ = 0 through negative values, I(t) moves away from a in the 
direction of —v. 


Point-Direction Form ofa Line The equation ofthe line / through the tip of a 
and pointing in the direction of the vector v is I(t) = a + tv, where the parameter 
t takes on all real values. In coordinate form, the equations are 

x= x +at, 

y-cy tbt, 

z= tet, 


where a = (xi, y1, zi) and v = (a, b, c). For lines in the xy plane, one simply 
drops the z component. 


|. TWIG REN Determine the equation of the line / passing through (1, 0, 0) in 
the direction j. See Figure 1.1.21. 


Figure 1.1.21 The line / passes 
through the tip of i in the direction j. 
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SOLUTION The desired line can be expressed parametrically as I(t) = i+ tj. In 
terms of coordinates, 


Yr) = (1, 0,0) +40, 1,0) =(1, 1,0). A 


EXAMPLE 12 


(a) Find the equations of the line in space through the point (3, —1, 2) in the direction 
2i — 3j + 4k. 


(b) Findtheequation ofthe line in the plane through the point (1, —6) in the direction 
of 5i — xj. 


(c) In what direction does the line x = —3t + 2, y = —2(t — 1), z = 8t + 2 point? 
SOLUTION 


(a) Herea = (3, —1, 2) = (xi, yi, zi)and v = 2i — 3j + 4k, soa = 2, b = —3, and 
c — 4. From the box above, the equations are 


x — 34 2t, y2-1-3t, z —2- 4t. 


(b) Here a = (1, —6) and v = 5i — 7 j, so the required line is 


Kt) = (1, —6) + (St, —1) = (1 + 5t, —6 — zt); 
that is, 
x=1+5t, y= -6 — nt. 


(c) Using the preceding box, we construct the direction v = ai + bj + ck from the 
coefficients of t: a — —3, b — —2, c — 8. Thus, the line points in the direction 
of v= —3i—2j+8k. A 


AGBE] Do the two lines (x, y, z) = (t, —6t + 1, 2t —8) and (x, y, z) = 
(3t + 1, 2t, 0) intersect? 


SOLUTION Ifthe lines intersect, there must be numbers 1; and t such that the 
corresponding points are equal: 


(4, —6h + 1,2 — 8) = Gto + 1,265, 0); 
that is, all three of the following equations hold: 


tj — 354 1, 
—6t1 +1 = 26, 
2t, —8-— 0. 
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From the third equation, t; = 4. The first equation then becomes 4 = 3f + 1; that is, 
t; = 1. We must check whether these values satisfy the middle equation: 


—6t 4-15 26. 
Since t; = 4 and f = 1, this reads 
i 
—244+1=2, 
which is false, so the lines do not intersect. A 


Notice that there can be many equations of the same line. Some may be obtained 
by choosing instead of a, a different point on the given line, and forming the parametric 
equation of the line beginning at that point and in the direction of v. For example, the 
endpoint ofa + vison the line I(t) = a + tv, and thus, lı (t) = (a + v) + tv represents 
the same line. Still other equations may be obtained by observing that if w 4 0, the 
vector av has the same (or opposite) direction as v. Thus, z(t) = a + tæv is another 
equation of I(t) = a + tv. 

For example, both I(t) = (1, 0, 0) + (t, £, 0) and lı(s) = (0, —1, 0) + (s, s, 0) 
represent the same line since both are in the direction i 4- j and both pass through 
the point (1, 0, 0); I passes through (1, 0, 0) at  — 0 and I; passes through (1, 0, 0) at 
ge 

Therefore, the equation of a line is not uniquely determined. Nevertheless, it 
is customary to use the term "the" equation of a line. Keeping this in mind, let us 
derive the equation of a line passing through the endpoints of two given vectors a 
and b. Because the vector b — a is parallel to the directed line segment from a to b, 
we calculate the parametric equation of the line passing through a in the direction of 
b — a (Figure 1.1.22). Thus, 


Kr) 2a--t(b—a)y — thatis, — Kt) 2 (1 — t)a4- tb. 


Figure 1.1.22 The line /, parametrically given 
by I(t) = a + t(b — a) = (1 — t)a + tb, passes 
through the tips of a and b. 


t(b—a) 


As t increases from 0 to 1, ¢(b — a) starts as the zero vector and increases in length 
(remaining in the direction of b — a) until at ¢ = 1 it is the vector b — a. Thus, for 
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Kt) = a + t(b — a), ast increases from 0 to 1, the vector I(t) moves from the endpoint 
of a to the endpoint of b along the directed line segment from a to b. 

If P = (xi, yi, zi) is the tip of a and Q = (x2, y2, z2) is the tip of b, then v = 
(x2 — xı )i + (yo — yi)i + (Z2 — zi)K, and so the equations of the line are 


x —xjpT (xo— xyt, 

y =y + (z -yyt. 

z =Z; + (z»-zyt. 
By eliminating ż, these can be written as 


EE. 20:30-04:22] 


X2—Xi -A Zz2—2| 


Parametric Equation of a Line: Point-Point Form The parametric equa- 
tions of the line / through the points P = (xi, yi, zi) and Q = (x2, y2, z2) are 


x = xi t -x)t, 
y =) + (22 —y1)t, 
z-—z-4(z-z)t, 


where (x, y, z) is the general point of Z, and the parameter ¢ takes on all real 
values. 


Find the equation of the line through (2, 1, —3) and (6, —1, —5). 
SOLUTION Using the preceding box, we choose (xi, yi, zi) — (2, 1, —3) and 
(X2, y2, z2) = (6, —1, —5), so the equations are 

x 224 (6—2) —2 4 4t, 
yd): i— 2f, 
j=- (5 = (Dee 3-25, ow 


PWIA R MEY Find the equation of the line passing through (—1, 1, 0) and 
(0, 0, 1) (see Figure 1.1.23). 


SOLUTION Letting a = —i + j and b = k represent the given points, we have 


Yr) =(1—#(-14 p+ tk = —( — ti + (1 — 0j + tk. 
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(0,0,1) 


Figure 1.1.23 Finding the 
equation of the line through 
two points. 


The equation of this line may thus be written as 
Kr) 2 (t — Di 4- (1 — 0j + tk, 


or, equivalently, if I(t) = xi + yj + zk, 


x-—t—l, y=l-t, z-—4 á 


The description of a line segment requires that the domain of the parameter t be 
restricted, as in the following example. 


DGIB Find the equation of the line segment between (1, 1, 1) and 
(2,752). 


SOLUTION The line through (1, 1, 1) and (2, 1, 2) is described in parametric 
form by (x, y, z) =(1 o t, 1, 1+ £), as t takes on all real values. When t = 0, the 
point (x, y, z) is (1, 1, 1), and when ¢ = 1, the point (x, y, z) is (2, 1, 2). Thus, the 
point (x, y, z) lies between (1, 1, 1) and (2, 1, 2) when 0 < ¢ < 1, so the line segment 
is described by the equations 


x=] +$ 
yz-l 
z=1+t, 


together with the inequalities 0 < £ < 1. A 
We can also give parametric descriptions of geometric objects other than lines. 


JO CWIIUEDENE Describe the points that lie within the parallelogram whose ad- 
jacent sides are the vectors a and b based at the origin (“within” includes points on 
the edges of the parallelogram). 
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SOLUTION Consider Figure 1.1.24. If P is any point within the given parallel- 
ogram and we construct lines /; and /; through P parallel to the vectors a and b, 
respectively, we see that /; intersects the side of the parallelogram determined by the 
vector b at some point tb, where 0 < t < 1. Likewise, l intersects the side determined 
by the vector a at some point sa, where 0 < s < 1. 


b P 
a 4 h Figure 1.1.24 Describing points within the 
parallelogram formed by vectors a and b, with vertex 0. 
lai 
b 


Note that P is the endpoint of the diagonal of a parallelogram having adjacent 
sides sa and tb; hence, if v denotes the vector OP, we see that v = sa + tb. We 
conclude that all the points in the given parallelogram are endpoints of vectors of 
the form sa + tb for 0 € s < 1 and 0 < t < 1. Reversing our steps, we see that all 
vectors of this form end within the parallelogram. A 


As two different lines through the origin determine a plane through the origin, 
so do two nonparallel vectors. If we apply the same reasoning as in Example 17, we 
see that the entire plane formed by two nonparallel vectors v and w consists of all 
points of the form sv + tw where s and t can be any real numbers, as in Figure 1.1.25. 


Figure 1.1.25 Describing points P in 
the plane formed from vectors v and w. 


We have thus described the points in the plane by two parameters. For this reason, 
we say the plane is two-dimensional. Similarly, a line is called one-dimensional 
whether it lies in the plane or in space or is the real number line itself. 
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The plane determined by v and w is called the plane spanned by v and w. When v 
is a scalar multiple of w and w Æ 0, then v and ware parallel and the plane degenerates 
toa straight line. When v — w — 0 (that is, both are zero vectors), we obtain a single 


point. 


There are three particular planes that arise naturally in a coordinate system and 
that will be useful to us later. We call the plane spanned by vectors i and j the xy 
plane, the plane spanned by j and k the yz plane, and the plane spanned by i and k 


the xz plane. These planes are illustrated in Figure 1.1.26. 


z 


yz plane 


xz plane 


xy plane 


EXERCISES 
(Exercises with colored numbers are solved in the Study Guide.) 


Complete the computations in Exercises 1 to 4. 
1. (—21, 23) — (?, 6) = (—25,?) 
2. 3(133, —0.33, 0) + (—399, 0.99, 0) = (?, ?, ?) 
3. (8a, —2b, 13c) = (52, 12, 11) + 202, 2, ? 


4. (2,3,5) — 4i - 33 2 (2,2,? 


Figure 1.1.26 The three coordinate planes. 


In Exercises 5 to 8, sketch the given vectors v and w. On your sketch, draw in —v, v +w, and 


y —w. 

5. v= Q, 1) and w = (1,2) 

6. v— (0,4) and w = (2, 1) 

7. v= (2,3, —6) and w = (—1, 1, 1) 


8. v= (2, 1, 3) and w = (—2, 0, -1) 
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9. What restrictions must be made on x, y, and z so that the triple (x, y, z) will represent a 
point on the y axis? On the z axis? In the xz plane? In the yz plane? 


10. (a) Generalize the geometric construction in Figure 1.1.7 to show that if v; = (x, y, z) 
and v2 = (x^, y', z^), then vj + v2 = (x + x, y + yz +7’). 

(b) Using an argument based on similar triangles, prove that ev = (ox, wy, wz) when 
v —(x,y,z). 


In Exercises 11 to 17, use set theoretic or vector notation or both to describe the points that 
lie in the given configurations. 


11. The plane spanned by v; — (2, 7, 0) and v; — (0,2, 7) 

12. The plane spanned by v, = (3, —1, 1) and v? = (0,3, 4) 

13. The line passing through (—1, —1, —1) in the direction of j 

14. The line passing through (0, 2, 1) in the direction of 2i — k 

15. The line passing through (—1, —1, —1) and (1, —1,2) 

16. The line passing through (—5, 0, 4) and (6, —3, 2) 

17. The parallelogram whose adjacent sides are the vectors i + 3k and —2j 


18. Find the points of intersection of the line x = 3 + 2t, y = 7 + 8t, z = —2 + t, that is, 
I(t) = (3 + 2t, 7 + 8t, —2 + t), with the coordinate planes. 


19. Show that there are no points (x, y, z) satisfying 2x — 3y + z — 2 = 0 and lying on the 
line v = (2, 22, —1) + ¢(1, 1, 1). 


20. Show that every point on the line v = (1, —1, 2) + ¢(2, 3, 1) satisfies the equation 
5x —-3y -z— 6 — 0. 


21. Determine whether the lines x = 3t 4-2, y = t — 1,z = 6t + l, and x = 3s — 1, 
y =s—2,z=s intersect. 


22. Do the lines (x, y, z) = (t + 4, 4t + 5, t — 2) and (x, y, z) = Qs + 3,s +1, 2s — 3) 
intersect? 


In Exercises 23 to 25, use vector methods to describe the given configurations. 
23. The parallelepiped with edges the vectors a, b, and c emanating from the origin. 


24. The points within the parallelogram with one corner at (xo, yo, zo) Whose sides extending 
from that corner are equal in magnitude and direction to vectors a and b. 


25. The plane determined by the three points (xo, yo, zo), (x1, Yı, zi), and (x2, 2, z2). 
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Prove the statements in Exercises 26 to 28. 


26. The line segment joining the midpoints of two sides of a triangle is parallel to and has 
half the length of the third side. 


27. If PQR is a triangle in space and b > 0 is a number, then there is a triangle with sides 
parallel to those of PQR and side lengths b times those of PQR. 


28. The medians ofa triangle intersect at a point, and this point divides each median in a 
ratio of 2:1. 


Problems 29 and 30 require some knowledge of chemical notation. 


29. Write the chemical equation CO + HO = H3 + CO» as an equation in ordered triples 
(x1, X2, x3) where xi, x2, x3 are the number of carbon, hydrogen, and oxygen atoms, 
respectively, in each molecule. 


30. (a) Write the chemical equation pC3H405 + qO» = rCO2 + sH20 as an equation in 
ordered triples with unknown coefficients p, q, r, and s. 

(b) Find the smallest positive integer solution for p, q,r, and s. 

(c) Illustrate the solution by a vector diagram in space. 


31. Find a line that lies entirely in the set defined by the equation x? + y? — z? = 1. 


1.2 The Inner Product, Length, and Distance 


In this section and the next we shall discuss two products of vectors: the inner prod- 
uct and the cross product. These are very useful in physical applications and have 
interesting geometric interpretations. The first product we shall consider is called the 
inner product. The name dot product is often used instead. 


'The Inner Product 


Suppose we have two vectors a and b in R? (Figure 1.2.1) and we wish to determine 
the angle between them, that is, the smaller angle subtended by a and b in the plane 


Figure 1.2.1 0 is the angle between the vectors a and b. 
bi: 


— y 


24 


The Geometry of Euclidean Space 


that they span. The inner product enables us to do this. Let us first develop the concept 
formally and then prove that this product does what we claim. Leta = aji + a?j + a3k 
and b = bii + bj + bsk. We define the inner product of a and b, written a - b, to be 
the real number 


a:b—ajbi + a2b; + a3b3. 


Note that the inner product of two vectors is a scalar quantity. Sometimes the inner 
product is denoted (a, b); thus, (a, b) and a+b mean exactly the same thing. 


(a) Ifa=3i+j—2kandb=i—j+k, calculate a - b. 
(b) Calculate (2i + j — k) - (3k — 2j). 

SOLUTION 

(à) a-b=3-1+1-(-1I)+(2)-1=3-1-2=0. 


(b Qi-cj-—k):Gk — 2j) = Qi- j — k) - (0i — 2j + 3k) 
z2.0—1:2—1-.32—5. A 


Certain properties of the inner product follow from the definition. If a, b, and c 
are vectors in R? and q and f are real numbers, then 


(i) a-a>0; 
a-a=0 if and only if a=0. 
(ii) wa-b = o(a- b) and a- Bb = f(a- b). 
(iii) a-(b+c)=a-b+a-e and (a+b)-c=a-c+b-e. 
(iv) a-b=b-a. 


To prove the first of these properties, observe that if a = aji + aj + a3k, then 
a-a — a +a} +a}. Because aj, a2, and a3 are real numbers, we know a? > 0, 
a > 0, a3 > 0. Thus, a-a > 0. Moreover, if a : a +a? 0, then a; = a? 
a3 = 0; therefore, a = 0 (zero vector). The proofs of the other properties of the inner 
product are also easily obtained. 

It follows from the Pythagorean theorem that the length of the vector a = aji + 
aj + ask is ya? + a2 + a? (see Figure 1.2.2). The length of the vector a is denoted 
by |[a||. This quantity is often called the norm of a. Because a- a = a + a + a, it 
follows that 


all = (a a)'?. 
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Va + a + a 


Figure 1.2.2 The length of the vector a = (a1, a2, a3) 
is given by the Pythagorean formula: ya? + a2 + a5. 


Unit Vectors 


Vectors with norm 1 are called unit vectors. For example, the vectors i, j, k are unit 
vectors. Observe that for any nonzero vector a, a/||al| is a unit vector; when we divide 
a by |lall, we say that we have normalized a. 


EXAMPLE 2| 


(a) Normalize v = 2i + 3j — 3k. 

(b) Find unit vectors a, b, and c in the plane such that b + c = a. 

SOLUTION 

(a) We have ||v|| = /2? + 3? + (1/2)? = (1/2)453, so the normalization of v is 


bg Megs Og, 41. 
lvl ^ V3 vs a 


(b) Because all three vectors are to have length 1, a triangle with sides a, b, and c 
must be equilateral, as in Figure 1.2.3. Orienting the triangle as in the figure, we 
take a — i, then necessarily 


1 43, 1, 43 


b= tah and Serpens d: 


Note that indeed ||al = ||b|| = ||cl| = 1 and thatb +c =a. a 


Figure 1.2.3 The vectors a, b, and c are represented by the sides of an 
equilateral triangle. 
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cos 0 
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In the plane, define the vector ig = (cos @)i + (sin 0)j, which is the unit vector 
making an angle 0 with the x axis (see Figure 1.2.4). 


Figure 1.2.4 The coordinates of ig are cos 0 and sin 6; it is a unit 
vector because cos? 0 + sin? 0 = 1. 


E4 


If a and b are vectors, we have seen that the vector b — a is parallel to and has the 
same magnitude as the directed line segment from the endpoint of a to the endpoint 
of b. It follows that the distance from the endpoint of a to the endpoint of b is ||b — al 
(see Figure 1.2.5). 


Inner Product, Length, and Distance Letting a = a,i+ aj +a3k and 


b = bii + boj + bsk, their inner product is 
a-b = aibi + aobo + a3b3, 

while the length of a is 
lal = Vara = Va? + a +a. 
To normalize a vector a, form the vector 

a 
Tal’ 
The distance between the endpoints of a and b is ||a — b||, and the distance 
between P and Q is ||PQ||. 
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1.7099 Find the distance from the endpoint of the vector i, that is, the 
point (1, 0, 0), to the endpoint of the vector j, that is, the point (0, 1, 0). 


SOLUTION |j—il = (0—1X 4-(1—0? +00? — 2. A 


The Angle Between Two Vectors 


Let us now show that the inner product does indeed measure the angle between two 
vectors. 


THEOREM 1 Leta and b be two vectors in R? and let 9, where 0 < 0 < x, 
be the angle between them (Figure 1.2.6). Then 


a-b = |lal|||b|| cos 8. 


It follows from the equation a - b = ||a||||b|| cos that if a and b are nonzero, we 
may express the angle between them as 


a-b 
a= eo =). 
llall |b] 


Figure 1.2.6 The vectors a, b, and the angle 0 between them; 
the geometry for Theorem 1 and its proof. 


PROOF If we apply the law of cosines from trigonometry to the triangle with one 
vertex at the origin and adjacent sides determined by the vectors a and b (as in the 
figure), it follows that 


Ib — all = ljal? + Ibl? — 2llalllbi| cose. 
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Because ||b — all? = (b — a)- (b — a), lal? = a - a, and ||b||? = b- b, we can rewrite 
the above equation as 


(b — a)-(b—a) = a-a+b-b— 2l[alll|bl| cos0. 


We can also expand (b — a) - (b — a) as follows: 


(b—a)-(b—a)—b-(b—a)—a-(b—a) 
=b-b—b-a—a-b+a-a 
=a-a+b-b—2a-b. 


Thus, 
a-a+b-b—2a-b=a-a+b-b— 2|al|||bl] cos. 
That is, 


a-b = |jallllb]| cos6. m 


DGVB] Find the angle between the vectors i+j+k and i+j—k 
(see Figure 1.2.7). 


Figure 1.2.7 Finding the angle between a = i + j +k 
andb —i-dj-—k. 


SOLUTION Using Theorem 1, we have 
Gi+j+k)-(G+j-—k = lli - j + Klilli 4- j — kl cos, 
and so 


1+ 1-— 1 = (V3)(3) cos. 
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Hence, 
cos = 1. 
That is, 


0- cos (i) 7 1.23 radians (71^). A 


'The Cauchy-Schwarz Inequality 


Theorem 1 shows that the inner product of two vectors is the product of their lengths 
times the cosine of the angle between them. This relationship is often of value in 
problems of a geometric nature. An important consequence of Theorem 1 is: 


COROLLARY: Cauchy-Schwarz Inequality For any two vectors a and b, 
we have 


la-b| < llalllibil 


with equality if and only if a is a scalar multiple of b, or one of them is 0. 


PROOF Ifa is not a scalar multiple of b, then 6, the angle between them, is not 
Zero or zt, and so |cos6| < 1, and thus the inequality holds; in fact, if a and b are 
both nonzero, strict inequality holds in this case. When a is a scalar multiple of b, 
then 6 = 0 or z and |cos0| = 1, so equality holds in this case. m 


Wide Verify the Cauchy-Schwarz inequality for a = —i + j +k and 
b=3i+k. 


SOLUTION The dot product is a-b — —3--04-1— —2, so |a-b| 2 2. 
Also, lal = /14- 12-1 — 4/3 and ||bl| 2 4/9 +T = V10, and it is true that 
2 < J/3- V10 because 4/3 - VIO > V3- 4/3 23 >2. A 


If a and b are nonzero vectors in IR? and 6 is the angle between them, we see 
that a- b = 0 if and only if cos@ = 0. Thus, the inner product of two nonzero vec- 
tors is zero if and only if the vectors are perpendicular. Hence, the inner product 
provides us with a convenient method for determining whether two vectors are per- 
pendicular. Often we say that perpendicular vectors are orthogonal. The standard basis 
vectors i, j, and k are mutually orthogonal and of length 1; any such system is called 
orthonormal. We shall adopt the convention that the zero vector is orthogonal to 
all vectors. 
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FSSSVISEN The vectors ip = (cos 6)i + (sin0)j and jp = —(sin6)i + (cos)j 


are orthogonal, because 


ig - jo = —cos@ sin@ + sinó cos6 = 0 


(see Figure 1.2.8). A 


Figure 1.2.8 The vectors ig and jọ are orthogonal and of unit 
length, that is, they are orthonormal. 


DGIN Let a and b be two nonzero orthogonal vectors. If c is a vector in 
the plane spanned by a and b, then there are scalars œ and f such that c = wa + fb. 
Use the inner product to determine o and £ (see Figure 1.2.9). 


z 


Figure 1.2.9 The geometry for finding œ 
and £, where c = oa + Bb. 


llel| cos 0= m 


SOLUTION Taking the inner product of a and c, we have 


a-c=a-(aa+ Bb) =aa-a+ Ba-b. 
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Because a and b are orthogonal, a - b = 0, and so 


a-c a-c 
"LITE 
Similarly, 
b.c b.c 
P= iy pee ^ 


Orthogonal Projection 


In the preceding example, the vector wa is called the projection of c along a, and Bb 
is its projection along b. Let us formulate this idea more generally. If v is a vector, 
and / is the line through the origin in the direction of a vector a, then the orthogonal 
projection of v on a is the vector p whose tip is obtained by dropping a perpendicular 
line to / from the tip of v, as in Figure 1.2.10. 


Figure 1.2.10 p is the orthogonal projection of v on a. 


Referring to the figure, we see that p is a multiple of a and that v is the sum of p 
and a vector q perpendicular to a. Thus, 


v=ca+q, 


where p = caanda - q = 0. Taking the dot product ofa with both sides of v = ca + q, 
we find a- v = ca-a, soc = (a - v)/(a-a), and hence 


E» a-v " 
P= Tae» 
The length of p is 
la-v| la - v| 
lipii = lali = = ||vI| cos6. 
llall? llall 


Orthogonal Projection The orthogonal projection of v on a is the vector 


ay 
p-—-.—7a. 
lall? 
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DGIB E] Find the orthogonal projection of i+ j on i — 2j. 


SOLUTION With a =i — 2j and v = i + j, the orthogonal projection of v on a 
is 


a-v 1—2 1 
_ i-20- l6-2i 
aca *= 1 q40- 20-50-20 


(see Figure 1.2.11). A 


Orthogonal Figure 1.2.11 The orthogonal projection of v on a equals —ta. 
projection 


of von a 


The Triangle Inequality 


A useful consequence of the Cauchy-Schwarz inequality, which is called the triangle 
inequality, relates the lengths of vectors a and b and of their sum a + b. Geometrically, 
the triangle inequality says that the length of any side of a triangle is no greater than 
the sum of the lengths of the other two sides (see Figure 1.2.12). 


Figure 1.2.12 This geometry shows that 
a IIOQII < }OR|| + IIRQII or, in vector notation, that 
la + bl| < |lal] + l|b]], which is the triangle inequality. 


THEOREM 2: Triangle Inequality For vectors a and b in space, 


la + bl] < llall + libil- 
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PROOF While this result may be clear geometrically, it is useful to give a proof 
using the Cauchy-Schwarz inequality, as it will generalize to n-dimensional vectors. 
We consider the square of the left-hand side: 


lla + bl? = (a + b) - (a + b) = lal? + 2a - b + [bl 
By the Cauchy-Schwarz inequality, we have 
lal? + 2a- b + IIb]? < lal? + 2llalllibl] + IIb I? = (llall + Ibl? 
Thus, 
lla + bl < (llall + Ibl; 


taking square roots proves the result. m 


EXAMPLE 9 
(a) Verify the triangle inequality for a = i+ j and b = 2i + j + k. 


(b) Prove that ||u — v|| < ||u — w|| + ||w — vl| for any vectors u, v, and w. Illustrate 
with a figure in which u, v, and w have the same base point. 


SOLUTION 


(a) We have at+b=3i+2j+k, so la+bl| 2 4/9 3-4 4- 1 = v14. On the 
other hand, |lal| = 4/2 and ||b|| = 4/6, so the triangle inequality asserts that 
J/14 Es /2 + /6. The numbers bear this out: 4/14 ~ 3.74, while J24+JS6% 
1.41 + 2.45 = 3.86. 


(b) We find that u — v = (u — w) + (w — v), so the result follows from the triangle 
inequality with a replaced by u — w and b replaced by w — v. Geometrically, 
we are considering the shaded triangle in Figure 1.2.13. A 


Figure 1.2.13 Illustrating the inequality 
lu — vl < lu — wll + lw — vll. 


Physical Applications of Vectors 


A simple example of a physical quantity represented by a vector is a displacement. 
Suppose that, on a part of the earth's surface small enough to be considered flat, we 
introduce coordinates so that the x axis points east, the y axis points north, and the 
unit of length is the kilometer. If we are at a point P and wish to get to a point Q, 
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E 
the displacement vector d — PQ joining P to Q tells us the direction and distance we 
have to travel. If x and y are the components of this vector, the displacement of P to 
Q is *x kilometers east, y kilometers north." 


| aB Suppose that two navigators who cannot see one another but can 
communicate by radio wish to determine the relative position of their ships. Explain 
how they can do this if they can each determine their displacement vector to the same 
lighthouse. 


SOLUTION Let P, and P, be the positions of the ships, and let Q be the position 
of the lighthouse. The displacement of the lighthouse from the ith ship is the vector 
d; joining P; to Q. The displacement of the second ship from the first is the vector d 
joining P, to P2. We have d + d; = d (Figure 1.2.14), andsod = dı — d». Thatis, the 
displacement from one ship to the other is the difference between the displacements 
from the ships to the lighthouse. A 


Figure 1.2.14 Vector methods can be used to locate objects. 


We can also represent the velocity of a moving object as a vector. For the moment, 
we will consider only objects moving at uniform speed along straight lines. Suppose, 
for example, that a boat is steaming across a lake at 10 kilometers per hour (km/h) in 
the northeast direction. After 1 hour of travel, the displacement is (10/ 42, 10/4/2) ~ 
(7.07, 7.07); see Figure 1.2.15. 


Position after | h 


10 Figure 1.2.15 If an object moves northeast at 10 km/h, its velocity 
2 vector has components (10/42, 10/4/2) = 10(1/4/2, 1/4/2), where 
(1/42, 1/4/2) is the unit vector of the northeast direction. 


Initial 10 
position 2 


The vector whose components are (10/4/2, 10/4/2) is called the velocity vec- 
tor of the boat. In general, if an object is moving uniformly along a straight line, 
its velocity vector is the displacement vector from the position at any moment to 
the position | unit of time later. If a current appears on the lake, moving due 
eastward at 2 km/h, and the boat continues to point in the same direction with its 
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engine running at the same rate, its displacement after 1 hour will have components 
given by (10/4/2 + 2, 10/4/2); see Figure 1.2.16. The new velocity vector, therefore, 
has components (10/4/2 + 2, 10/4/2). We note that this is the sum of the original 
velocity vector (10/4/2, 10/4/2) of the boat and the velocity vector (2, 0) of the 
current. 


Displacement due 

to current 
Displacement 
due to 
engine 


Figure 1.2.16 The total displacement is the sum of the 
displacements due to the engine and the current. 


Total displacement 


Displacement and Velocity If an object has a (constant) velocity vector v, 
then in ¢ units of time the resulting displacement vector of the object is d = tv; 


thus, after time ż = 1, the displacement vector equals the velocity vector. See 
Figure 1.2.17. 


Figure 1.2.17 Displacement — time x velocity. 


Displacement in time t 


GVA A birdis flying ina straight line with velocity vector 10i + 6j +k 
(in kilometers per hour). Suppose that (x, y) are its coordinates on the ground and z 
is its height above the ground. 


(a) Ifthe bird is at position (1, 2, 3) at a certain moment, what is its location 1 hour 
later? 1 minute later? 


(b) How many seconds does it take the bird to climb 10 meters? 


SOLUTION (a) The displacement vector from (1, 2, 3) after 1 hour is given by 
10i + 6j + k, so the new position is (1, 2, 3) + (10, 6, 1) = (11, 8, 4). After 1 minute, 
the displacement vector from (1, 2, 3) is d; (10i + 6j + k) = £i + 4j+ Gk, and so 
the new position is (1, 2, 3) + G. > a) = G. z, m i 

(b) After t seconds (= 1/3600 hours), the displacement vector from (1, 2, 3) is 
(1/3600) (10i + 6j + k) = (t/360)i + (¢/600)j + (¢/3600)k. The increase in altitude 
is the z component, namely, 7/3600. This will equal 10 m (= 15 km) when ¢/3600 = 


1 ; = 
joo: that is, when? = 36 s. A 
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Physical forces have magnitude and direction and may thus be 
represented by vectors. If several forces act at once on an object, the resultant force 
is represented by the sum of the individual force vectors. Suppose that forces i+ k 
and j + k are acting on a body. What third force F must we impose to counteract the 
two—that is, to make the total force equal to zero? 


SOLUTION The force F should be chosen so that (i+ k) + (j + k) + F = 0; 
that is, F = —(i+ k) — (j + k) = —i — j — 2k. (Recall that 0 is the zero vector, the 
vector whose components are all zero.) A 


EXERCISES 
1. Calculate (3i + 2j + k) - (i+ 2j — k). 


2. Calculate a- b where a = 2i + 10j — 12k and b = —3i+ 4k. 
3. Find the angle between 7j + 19k and —2i — j (to the nearest degree). 
4. Compute u - v, where u = /3i — 315j + 22k and v = u/|lull. 
5. Is ||8i — 12k|| - ||6j + k|| — |(8i — 12k) - (6j + k)| equal to zero? Explain. 
In Exercises 6 to 11, compute ||u||, ||v||, and u - v for the given vectors in R2. 
6. u= 15i 2j + 4k, v = mi + 3j — k 
7. u=2j—i,v=-j+i 
8. u=S5i—j+2k,v=i+j—k 
9. u— —i 3j - Kk v = —2i — 3j — 7k 
10. u = —i + 3k, v = 4j 
11. u = —i + 2j — 3k, v = —i — 3j + 4k 


12. Normalize the vectors in Exercises 6 to 8. (Only the solution corresponding to Exercise 7 
is in the Student Guide.) 


13. Find the angle between the vectors in Exercises 9 to 11. If necessary, express your 
answer in terms of cos~!. 


14. Find the projection of u = —i + j + k onto v = 2i + j — 3k. 
15. Find the projection of v = 2i + j — 3k onto u = —i + j +k. 


16. What restrictions must be made on the scalar b so that the vector 2i + bj is orthogonal to 
(a) —3i + 2j + k and (b) k? 


17. Find two nonparallel vectors both orthogonal to (1, 1, 1). 
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18. Find the line through (3, 1, —2) that intersects and is perpendicular to the line 
x=-lt+t,y=—24+1t,z 1 + t. [HiNr: If (xo, yo, Zo) is the point of intersection, find 
its coordinates.] 


19. A ship at position (1, 0) on a nautical chart (with north in the positive y direction) sights 
a rock at position (2, 4). What is the vector joining the ship to the rock? What angle 0 does 
this vector make with due north? (This is called the bearing of the rock from the ship.) 


20. Suppose that the ship in Exercise 19 is pointing due north and traveling at a speed of 4 
knots relative to the water. There is a current flowing due east at 1 knot. The units on the chart 
are nautical miles; 1 knot = 1 nautical mile per hour. 


(a) Ifthere were no current, what vector u would represent the velocity of the ship 
relative to the sea bottom? 

(b) Ifthe ship were just drifting with the current, what vector v would represent its 
velocity relative to the sea bottom? 

(c) What vector w represents the total velocity of the ship? 

(d) Where would the ship be after 1 hour? 

(e) Should the captain change course? 

(f) What if the rock were an iceberg? 


21. Anairplane is located at position (3, 4, 5) at noon and traveling with velocity 
400i + 500j — k kilometers per hour. The pilot spots an airport at position (23, 29, 0). 


(a) At what time will the plane pass directly over the airport? (Assume that the plane is 
flying over flat ground and that the vector k points straight up.) 
(b) How high above the airport will the plane be when it passes? 


22. The wind velocity v; is 40 miles per hour (mi/h) from east to west while an airplane 
travels with air speed v; of 100 mi/h due north. The speed of the airplane relative to the 
ground is the vector sum v; + v2. 


(a) Find v; + v2. 
(b) Draw a figure to scale. 


23. A force of 50 Ib is directed 50* above horizontal, pointing to the right. Determine its 
horizontal and vertical components. Display all results in a figure. 


24. Two persons pull horizontally on ropes attached to a post, the angle between the ropes 
being 60°. Person A pulls with a force of 150 Ib, while B pulls with a force of 110 Ib. 


(a) The resultant force is the vector sum of the two forces. Draw a figure to scale that 
graphically represents the three forces. 

(b) Using trigonometry, determine formulas for the vector components of the two forces 
in a conveniently chosen coordinate system. Perform the algebraic addition, and find the 
angle the resultant force makes with A. 


25. A 1-kilogram (1-kg) mass located at the origin is suspended by ropes attached to the two 
points (1, 1, 1) and (—1, —1, 1). If the force of gravity is pointing in the direction of the 
vector —k, what is the vector describing the force along each rope? [HINT: Use the symmetry 
of the problem. A 1-kg mass weighs 9.8 newtons (N).] 
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26. Suppose that an object moving in direction i + j is acted on by a force given by the 
vector 2i + j. Express this force as a sum of a force in the direction of motion and a force 
perpendicular to the direction of motion. 


27. A force of 6 N (newtons) makes an angle of 7/4 radian with the y axis, pointing to the 
right. The force acts against the movement of an object along the straight line connecting 
(1, 2) to (5, 4). 


(a) Find a formula for the force vector F. 

(b) Find the angle 0 between the displacement direction D = (5 — 1)i + (4 — 2)j and 
the force direction F. 

(c) The work done is F - D, or equivalently, ||F||||D|| cos @. Compute the work from both 
formulas and compare. 


1.3 Matrices, Determinants, and the Cross Product 


In Section 1.2 we defined a product of vectors that was a scalar. In this section we 
shall define a product of vectors that is a vector; that is, we shall show how, given two 
vectors a and b, we can produce a third vector a x b, called the cross product of a and 
b. This new vector will have the pleasing geometric property that it is perpendicular 
to the plane spanned (determined) by a and b. The definition of the cross product is 
based on the notions of the matrix and the determinant, and so these are developed 
first. Once this has been accomplished, we can study the geometric implications of 
the mathematical structure we have built. 


2 x 2 Matrices 


We define a 2 x 2 matrix to be an array 
au d 
d» an 
where a11, 412, 421, and an are four scalars. For example, 
2; 1 -1 0 d 13 7 
0 4r y yp = 6 11 
are 2 x 2 matrices. The determinant 


au a 
anı 422 


of such a matrix is the real number defined by the equation 


au dp 


= 41102) — 412021. (1) 
a» 022 
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EXAMPLE 1 


3 x 3 Matrices 


A 3 x 3 matrix is an array 


41 412 413 
42, an 02 
a31 d32 433 


where, again, each aj; is a scalar; a;; denotes the entry in the array that is in the ith 
row and the jth column. We define the determinant of a 3 x 3 matrix by the rule 


ai Gp di a» an 


a31 432 


421 423 


an 023 
2 +a 
31 3 


432 033 


; Q) 


421 an 425|-—dl 
a31 032 033 


Without some mnemonic device, formula (2) would be difficult to memorize. The rule 
to learn is that you move along the first row, multiplying a); by the determinant of 
the 2 x 2 matrix obtained by canceling out the first row and the jth column, and then 
you add these up, remembering to put a minus in front of the a;? term. For example, 
the determinant multiplied by the middle term of formula (2), namely, 


a2 023 
a31 433 


is obtained by crossing out the first row and the second column of the given 3 x 3 
matrix: 


EXAMPLE 2 


100 
10 jo ol jo 1 
0 1 0 T |-o] m E 
543^ qe p a p g 
123 
à 5 dei? Saal? Syst 32.3195. w 
ee 453-59 399 3 
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Properties of Determinants 


An important property of determinants is that interchanging two rows or two columns 
results in a change of sign. For 2 x 2 determinants, this is a consequence of the 
definition as follows: For rows, we have 


au a2 421 422 
= ananz — 421412 = —(421412 — 411422) 
aı 422 au an 
and for columns, 
au ad) ap an 
(anan — 411422) : . 
d» an an 2} 


We leave it to the reader to verify this property for the 3 x 3 case. 
A second fundamental property of determinants is that we can factor scalars out 
of any row or column. For 2 x 2 determinants, this means 


ea ia) ja Qaz jan 2) _ an Zaz | an Giz 
«d» an dj, Qan a3 an ai an «d, œan 
Similarly, for 3 x 3 determinants we have 
cay cay Ga a1 dp dg 41 cadi a3 
an a22 473) = & |d231 a2 23| = |ü21 C422 423|, 
431 d32 433 a31 432 033 431 &a32 433 


and so on. These results follow from the definitions. In particular, if any row or column 
consists of zeros, then the value of the determinant is zero. 

A third fundamental fact about determinants is the following: If we change a row 
(or column) by adding another row (or, respectively, column) to it, the value of the 
determinant remains the same. For the 2 x 2 case, this means that 


a dj. nb a-cb| | a a 
bi b bi b; b+aq, b+ 
aucta a EIL a, a2 
~ |b) +b: ba bj by + ba] 


1.3 Matrices, Determinants, and the Cross Product 41 


For the 3 x 3 case, this means 
a a a ath ath, @ +b aita a a 
bi b b|=| bi b; by |= |bi+b2 bj by, 


cy ey 065 €i [2] C3 C+ o C3 


and so on. Again, this property can be proved using the definition of the determinant. 


EXAMPLE 3 Bist eg 


a=ab+ fe; _ that is, a = (a1, a2, a3) = a(b1, b2, b3) + B(Ci, C2, c3). 


Show that 
aq, a a 
bj bj b3|—0. 
ĉi Cz C3 


SOLUTION We shall prove the case œ # 0, P # 0. The case œ = 0 = £ is trivial, 
and the case where exactly one of a, B is zero is a simple modification of the case we 
prove. Using the fundamental properties of determinants, the determinant in question 
is 
abı 4 Be, ab; c Be. abs + Bcs 
bi b; b 
[21 €2 cy 


1 abı + pci ab; + Beo. abs + Bes 
= ab; aby ab; 
& [4] c2 €3 
(factoring —1/o out of the second row) 


obi ab» abs 


Bey Bea Bcs 
(factoring —1/P out of the third row) 


( X 3 abi -- Bc; ab» Bc; abs + Bes 


1 | Ber pao Bo 


=—|-ab; —oe«b,; —ab; (adding the second row to the first row) 


op —Be, —Be -—fes 


0 0 0 
=—|-ab; —ab, —ab; (adding the third row to the first row) 


ap -Ba —Be, -—po 
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Closely related to these properties is the fact that we can expand a 3 x 3 determi- 
nant along any row or column using the signs in the following checkerboard pattern: 


For instance, the reader can check that we can expand *by minors" along the middle 
TOW: 


a1 dp dp» an 413 au 413 au an 
43 an 423) = —a5 + a22 — az 
a31 432 433 a32 03 d3 433 a31 432 


Let us redo the second determinant in Example 2 using this formula: 


| 2 3 2 3 1 3 1 2 
4 5 6=-4| |s| |-6  |-C4X-9-4 (X12) (-6(6) = 0. 
7 89 


1693, in connection with solutions of linear equations. Maclaurin and 
Cramer developed their properties between 1729 and 1750; in particular, 
they showed that the solution of the system of equations 


anxi + d12X2 + d13X3 = bi 
421X1 + 22X2 + 423X3 = b2 
431 %1 + a32x2 + a33x3 = b3 


is 


4 bi an ag 14 bi a Qe e bij | 
US b; a» a5.  x»— aa b? az, x3= ga a2 b; 
I 
b3 az az an bs a3 a3 az bs] | 
|| where | 


| ai, 412 443 
A= an 422 a3}, 
| a31 432 433 


a fact now known as Cramer’s rule. While this method is rather inefficient 
from a numerical point of view, it is of theoretical importance in matrix 
theory. Later, Vandermonde (1772) and Cauchy (1812), treating 
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determinants as a separate topic worthy of special attention, developed the 
field more systematically, with contributions by Laplace, Jacobi, and others. 
Formulas for volumes of parallelepipeds in terms of determinants are due to 
Lagrange (1775). We shall study these later in this section. Although during 
the nineteenth century mathematicians studied matrices and determinants, 
the subjects were considered separate. For the full history up to 1900, see 

T. Muir, The Theory of Determinants in the Historical Order of Development 
(reprinted by Dover, New York, 1960). 


Cross Products 


Now that we have established the necessary properties of determinants and discussed 
their history, we are ready to proceed with the cross product of vectors. 


DEFINITION: The Cross Product Suppose that a = aii + a?j + azk and 
b = bii + boj + bsk are vectors in R?. The cross product or vector product of 
a and b, denoted a x b, is defined to be the vector 


a2 d3|. a, 43). |a a 
asb— us c le el 
or, symbolically, 
i j k 
axb-ja a» aj. 
bi b bs 


Even though we only defined determinants for arrays of real numbers, this formal 
expression involving vectors is a useful memory aid for the cross product. 


Find (3i — j + k) x (i + 2j — k). 


SOLUTION 
i j k 
Gi-j+k)x@i+2j-k)=|3 -1 1)=-i+4j+7k 4 
1 2-1 


Certain algebraic properties of the cross product follow from the definition. If a, 
b, and c are vectors and a, B, and y are scalars, then 


(i) ax b = —(b x a) 


(ii) a x (Bb + yc) = B(a x b) + y(a x c) and (ea + fb) xe = 
a(a x €) 4- (b x c). 
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Note that a x a = —(a x a), by property (i). Thus, a x a = 0. In particular, 
ixi20, jxj=0, kxk=0. 

Also, 
ixj=k, pck=1, kxi=j, 


which can be remembered by cyclicly permuting i, j, k like this: 


To give a geometric interpretation of the cross product, we first introduce the 
triple product. Given three vectors a, b, and c, the real number 


(a x b)-c 


is called the triple product of a, b, and c (in that order). To obtain a formula for it, let 
a = aii + aj + azk, b = bii + b5j + b3k, and c = cii + cj + c3k. Then 


_ (|2 d. qm 43). a a Sis s 

(a x b)-c= k bi br b j+ "Es k) (cil + c2j + c3k) 
— {42 B|, ju a5 a a 
=le b^ -ib bi tb Bo] 


This is the expansion by minors of the third row of the determinant, so 


aj a2 a 
(axb)-e= |b; b bj. 
an G 


If c is a vector in the plane spanned by the vectors a and b, then the third row in the 
determinant expression for (a x b) - c is a linear combination of the first and second 
rows, and therefore (a x b)- c = 0. In other words, the vector a x b is orthogonal to 
any vector in the plane spanned by a and b, in particular to both a and b. 

Next, we calculate the length of a x b. Note that 


2 2 2 
a 3 a, a3 a a2 


b bs bi obs bi b 
(ab; — a3b2)? + (aib; — bias + (aib — bia}. 


la x b||? = 
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If we expand the terms in the last expression, we can recollect them to give 
(aj ^ 43 ^ a3)(b] -- b3 + b3) — (aibi + azb + asbsy, 
which equals 
lal? IIb]? — (a + b)? = lal? Ibl" — lla? IIb cos? 0 = Jal? II sin? 6 


where 6 is the angle between a and b, 0 < 0 < x. Taking square roots and using 
AI? = |k|, we find that ||a x b|| = llalllibll|sin6]. 

Combining our results, we conclude that a x b is a vector perpendicular to the 
plane P spanned by a and b with length ||a||||b|||sin0|. We see from Figure 1.3.1 that 
this length is also the area of the parallelogram (with base ||a]| and height ||b sin 0|) 
spanned by a and b. There are still two possible vectors that satisfy these conditions, 
because there are two choices of direction that are perpendicular (or normal) to P. 
This is clear from Figure 1.3.1, which shows the two choices n; and —n, perpendicular 
to P, with |[ni]| = ||-nil| = llalll|bIl|sin6]. 


Figure 1.3.1 n, and n» are the two possible vectors orthogonal 
to both a and b, and with norm ||al| |b|] | sin 8|. 


Which vector represents a x b, n; or —n;? The answer is n; . Try a few cases such 
as k = i x j to verify this. The following “right-hand rule" determines the direction 
of a x b in general. Take your right hand and place it so your fingers curl from a 
toward b through the acute angle 6, as in Figure 1.3.2. Then your thumb points in the 
direction of a x b. 


Figure 1.3.2 The right-hand rule for determining in which of the two 
possible directions a x b points. 
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'The Cross Product 
Geometric definition: a x b is the vector such that: 


(1) lla x bl] = |lall||b|| sin6, the area of the parallelogram spanned by a and b 
(8 is the angle between a and b;0 < 0 < 7r); see Figure 1.3.3. 


(2) ax b is perpendicular to a and b, and the triple (a, b, a x b) obeys the 
right-hand rule. 


Component formula: 
i j k 
(aii + aj + a3k) x (bii + boj + b3k) = |a] a a3 
bi b b 


= (mb; — asb5)i — (ayb3 — asb1)j + (aıb2 — azb1)k. 
Algebraic rules: 
1. ax b= 0 ifand only if a and b are parallel or a or b is zero. 
axb=-—bxa. 
ax(b+c)=axb+t+axe. 
(a+b)xc=axc+bxe. 
(aa) x b = a(a x b). 


wR YN 


Multiplication table: 


Second factor 
x i j k 


First j|-k 0 i 
factor k j —i 0 


length = |jb]] |sin 6] 


Figure 1.3.3 The length of a x b is the area of 
the parallelogram formed by a and b. 
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WIJBE Find the area of the parallelogram spanned by the two vectors 
a=i+ 2j + 3kandb = —i — k. 


SOLUTION We calculate the cross product of a and b by applying the com- 


ponent or determinant formula, with a; = 1, a2 —2, a3 —3, bi 1, b2 =0, 
bi =-l: 
a x b = [(2)(-]) — GX0)li + [G)(—D — CFD] + [)(0) — (7 DIK 
= —2i — 2j + 2k. 


Thus, the area is 


la x bl] = (72? - (2? + 2} 2243. A 


ID CWTIUEOI Find a unit vector orthogonal to the vectors i + j and j + k. 


SOLUTION A vector perpendicular to both i + j and j + k is their cross product, 
namely, the vector 


i j k 
G+)*xG4+W=|1 1 0-i-j4k 
0. l1 1 
Because ||i — j + k|| = 4/3, the vector 
ld j+k) 
Va J 


is a unit vector perpendicular toi+jandj+k. A 


GVB Derive an identity relating the dot and cross products from the 
formulas 


lu x vi] = |Jull||vl| sin 8 and u-v = |lull|lv|| cose 
by eliminating 0. 


SOLUTION Seeing sin@ and cos@ multiplied by the same expression suggests 
squaring the two formulas and adding the results. We get 


lu x vi? + (u - v? = lul liv] (si? 6 + cos? 8) = ful? lvl, 
so 
lu x vl? = [ul liv? — (u - v. 


This identity is interesting because it establishes a link between the dot and cross 
products. A 


48 


The Geometry of Euclidean Space 


Geometry of Determinants 


Using the cross product, we may obtain a basic geometric interpretation of 2 x 2 
and 3 x 3 determinants. Let a = aji + a?j and b = bii + b2j be two vectors in the 
plane. If 0 is the angle between a and b, we have seen that ||a x b|| = |lal|||b|||sin 8| 
is the area of the parallelogram with adjacent sides a and b. The cross product as a 
determinant is 


i j k 
axb=ja a 0|— 
bi bz 0 


a; a2 
bi b 


Thus, the area ||a x b|| is the absolute value of the determinant 


Geometry of 2 x 2 Determinants The absolute value of the determinant 


b, »|is the area of the parallelogram whose adjacent sides are the vectors 


a — aji + aj and b = bii + b;j. The sign of the determinant is + when, ro- 
tating in the counterclockwise direction, the angle from a to b is less than zr. 


IC WIESE Find the area of the triangle with vertices at the points (1, 1), (0, 2), 
and (3, 2) (Figure 1.3.4). 


Figure 1.3.4 (a) Find the area A of the shaded triangle by expressing the sides as 
vector differences (b) to get A = ||(b — a) x (c — a)||/2. 


SOLUTION Let a=i+j,b = 2j, and c = 3i + 2j. It is clear that the triangle 
whose vertices are the endpoints of the vectors a, b, and c has the same area as the 
triangle with vertices at 0, b — a, and c — a (Figure 1.3.4). Indeed, the latter is merely a 
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translation of the former triangle. Because the area of this translated triangle is one-half 
the area of the parallelogram with adjacent sides b — a i+j,ande—a=2i+j, 
we find that the area of the triangle with vertices (1, 1), (0, 2), and (3, 2) is the absolute 
value of 


1]|-1 1| 3 
2| a i=" 
thatis, 3/2. A 


There is an interpretation of determinants of 3 x 3 matrices as volumes that is 
analogous to the interpretation of determinants of 2 x 2 matrices as areas. 


Geometry of 3 x 3 Determinants The absolute value of the determinant 


aq, a a 
D= |b, b; b 
€( a G 


is the volume of the parallelepiped whose adjacent sides are the vectors 


a=ai+aj+ak, b=bitbj+b3k, and e=citeaj+ok. 


To prove the statement in the box above, we refer to Figure 1.3.5 and note that the length 
of the cross product, namely, ||a x b||, is the area of the parallelogram with adjacent 
sides a and b. Moreover, (a x b)-c = [|a x bl||[e|| cos y, where y is the angle that 
c makes with the normal to the plane spanned by a and b. Because the volume of the 
parallelepiped with adjacent sides a, b, and c is the product of the area of the base 
lla x b|| and the altitude ||c||| cos y|, it follows that the volume is |(a x b)- c)]. We 
saw earlier that (a x b) -c = D, so the volume equals the absolute value of D. 


Figure 1.3.5 The volume of the parallelepiped 
spanned by a, b, c is the absolute value of the 
determinant of the 3 x 3 matrix having a, b, c 
as its rows. 


er 
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GWE HS) Find the volume of the parallelepiped spanned by the three vectors 
i + 3k, 2i + j — 2k, and 5i + 4k. 


SOLUTION The volume is the absolute value of 
1 9 3 
2 1 -2|. 
5 0 4 


If we expand this determinant by minors by going down the second column, the only 
nonzero term is 


13 
i jou. 


so the volume equals 11. A 


Equations of Planes 


Let P bea plane in space, Po = (xo, yo, zo) a point on that plane, and suppose that n = 
Ai + Bj + Ck is a vector normal to that plane (see Figure 13.6). Let P = (x,y,z) 
bea point in IR?. Then P lies on the plane if and only if the vector PoP = (x — xo)i + 
(vy — yo)j + (Z — zo)k is perpendicular to n, that is, PoP -n = 0, or, equivalently, 


(Ai + Bj + Ck) - [(x — xo)i + (v — yo)j + (Z — zo)k] = 0. 


Figure 1.3.6 The points P of the plane : through Po and 
perpendicular to n satisfy the equation PoP - n = 0. 


Thus, 


A(x — xo) + B(y — yo) + C(z — zo) = 0. 
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Equation ofa Plane in Space The equation ofthe plane P through (xo, yo. zo) 
that has a normal vector n = 4i + Bj + Ck is 


A(x — xo) + B(y — yo) + Cz — zo) = 0; 
that is, (x, y, z) € P if and only if 


Ax + By+Cz+D=0 


where D = — Axo — Byo — Czo. 


The four numbers A, B, C, and D are not determined uniquely by the plane P. To 
see this, note that (x, y, z) satisfies the equation Ax + By + Cz + D = 0 if and only 
if it also satisfies the relation 


(AA)x + (AB)y + (AC)z + (AD) 20 


for any constant A Æ 0. Furthermore, if A, B, C, D and A’, B', C’, D' determine the 
same plane P, then A = AA’, B = XAB', C = AC', D = XD' for a scalar A. Conse- 
quently, A, B, C, D are determined by P up to a scalar multiple. 


DGA Determine an equation for the plane that is perpendicular to the 
vector i + j + k and contains the point (1, 0, 0). 


SOLUTION Using the general form A(x — xo) + B(y — yo) + C(z — zo) = 0, 
the plane is (x — 1) + 1(y — 0) + 1(z — 0) = 0; that is, x +y --z —1. A 


[9.6 $7028 IY Find an equation for the plane containing the three points 
(1, 1, 1), (2, 0, 0), and (1, 1, 0). 


SOLUTION Method 1. This is a “brute force" method that you can use if 
you have forgotten the vector methods. The equation for any plane is of the form 
Ax + By + Cz + D = 0. Because the points (1, 1, 1), (2, 0, 0), and (1, 1, 0) lie in 
the plane, we have 
A+B+C+D=0, 
2A + D=0, 


A+B +D=0. 


Proceeding by elimination, we reduce this system of equations to the form 
24+D=0 (second equation) 
2B+D=0 (2x third — second), 
C=0 (first — third). 
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Because the numbers A, B, C, and D are determined only up to a scalar multiple, we 
can fix the value of one of them, say A = 1, and then the others will be determined 
uniquely. We get 4 — 1, D — —2, B — 1, C — 0. Thus, an equation of the plane that 
contains the given points is x + y — 2 — 0. 

Method 2. Let P = (1, 1, 1), Q = (2, 0, 0), R = (1, 1, 0). Any vector normal to 
the plane must be orthogonal to the vectors QP and RP, which are parallel to the 
plane, because their endpoints lie on the plane. Thus, n = QP x RP is normal to the 
plane. Computing the cross product, we have 


Because the point (2, 0, 0) lies on the plane, we conclude that the equation is given 
by (x — 2)+(y — 0) +0- (z — 0) = 0; that is, x + y—-2=0. A 


Two planes are called parallel when their normal vectors are parallel. Thus, 
the planes 4j1x + Bıy + Cız + Dı = 0 and 42x + Boy + Coz + Dz = 0 are par- 
allel when n; = 41i + Bij + Cık and n? = 42i + Boj + Cok are parallel; that is, 
n; = cn» for a constant o. For example, the planes 


x—2y+z=0 and —2x +4y —2z= 10 
are parallel, but the planes 
x—2y+2=0 and 2x —2y+z=10 


are not parallel. 


Distance: Point to Plane 


Let us now determine the distance from a point E = (xi, yi, zi) to the plane P 
described by the equation A(x — xo) + B(y — yo) + C(z — zo) = Ax + By + Cz + 
D = 0. To do so, consider the unit normal vector 


_ Aid Bj Ck 
AP + B+ cU 


which is a unit vector normal to the plane. Drop a perpendicular from E to the 
plane and construct the triangle REQ shown in Figure 1.3.7. The distance d = 


ear 
|EQI| is the length of the projection of v = RE (the vector from R to E) onto n; 
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e E-G 91521) 


Figure 1.3.7 The geometry for determining the 
distance from the point E to plane 7. 


thus, 


Distance = |v -n| = |[Gxi — xo)i + (yı — yo)j + (1 — zo)k] - n| 
|A(xı — xo) + B1 — yo) + €Gi — zo)l 
AA + BP + C2 i 


If the plane is given in the form Ax + By + Cz + D = 0, then for any point 
(Xo; Yo. zo) on it, D = —( Axo + Byo + Czo). Substitution into the previous formula 
gives the following: 


Distance from a Point to a Plane The distance from (xi, yi, zi) to the plane 
Ax + By - Cz c D=0is 


|Ax; + By; + Cz; + DJ 
APB? + C2 


Distance 


PGI HS) Find the distance from Q = (2, 0, —1) to the plane 3x — 2y + 
Sz] 

SOLUTION We substitute into the formula in the preceding box the values x; — 2 
yı —0,z; = —1 (the point) and A = 3, B = —2, C = 8, D = 1 (the plane) to 
give 


[3-24+(—2)-04+8(-1)4+1| l-1] 1 
J33 + (23 + 8 Jm T 


Distance 
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The Origins of the Vector, Scalar, Dot, and Cross Products 


QUADRATIC EQUATIONS, CUBIC EQUATIONS, AND IMAGINARY 
NUMBERS. We know from Babylonian clay tablets that this great 
civilization possessed the quadratic formula, enabling them (in verbal form) 
to solve quadratic equations. Because the concept of negative numbers had to 
wait until the sixteenth century to see the light of day, the Babylonians did 
not consider either negative (or imaginary) solutions. 

With the Renaissance and the rediscovery of ancient learning, Italian 
mathematicians began to wonder about the solutions of cubic equations, 

x? + ax? + bx +c = 0, where a, b, and c are positive numbers. 

Around 1500, Scipione del Ferro, a professor in Bologna (the oldest 
European university) was able to solve cubics of the form x? -- bx — c, but 
kept his discovery secret. Before his death, he passed his formula to his 
successor, Antonio Fior, who for a while also kept the formula to himself. It 
remained a secret until a brilliant, self-taught mathematician named Nicolo 
Fontana, also known as Tartaglia (the stammerer), appeared on the scene. 
Tartaglia claimed he could solve the cubic, and Fior felt he needed to protect 
the priority of del Ferro, and so in response challenged Tartaglia to a public 
competition. 

We are told that Tartaglia was able to solve all of the thirty cubic 
equations posed by Fior. Amazingly, some scholars believe that Tartaglia 
discovered the formula for solutions to x? + cx = d only days before the 
contest was to take place. 

The greatest mathematician of the sixteenth century, Gerolamo 
Cardano (1501-1576)—a Renaissance scholar, mathematician, 
physician, and fortuneteller—gave the first published solution of the general 
cubic. Although born of modest means, he (like Tartaglia) rose, through 
effort and natural brilliance, to great fame. Cardano is the author of 
the first book on games of chance (marking the beginning of modern 
probability theory) and also of Ars Magna (the Great Art), which marks 
the beginning of modern algebra. It was in this book that Cardano 
published the solution to the general cubic x? + ax? + bx +c = 0. 

How did he get it? 

While working on his algebra book, and aware that Tartaglia was able to 
solve forms of cubic, Cardano, in 1539, wrote to Tartaglia asking for a 
meeting. After some cajoling, Tartaglia agreed. It was at this meeting that, in 
exchange for a pledge of secrecy (and we know how these generally go), 
Tartaglia revealed his solution, from which Cardano was able to derive a 
solution to the general equation, which then appeared in Ars Magna. Feeling 
betrayed, Tartaglia led a scathing attack on Cardano, leading to a small soap 
opera. 
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un 


What is important for us, at the moment, is that as a consequence of the 
method of solution, something very strange occurred. Consider the cubic 
x? — 15x — 4. Itsonly positive root is 4. However, the Tartaglia- Cardano 
solution formula yields 


xcu 1704. 42 x ESI (1) 


as the positive root. Thus, this number must be equal to 4. Yet this must be 
nonsense, because inside the cube root we are taking the square root of a 
negative number—at the time, an absolute impossibility. This was a real 
shock. Over 100 years later, in 1702, when Leibniz, codiscoverer of calculus, 
showed the great Dutch scientist Christian Huygens the formula 


V6 2 it V/-3-4 J1- V/-3 Q) 


Huygens was completely flabbergasted, and remarked that this equality 
“defies all human understanding." [Try, informally, to verify both formulas 
(1) and (2) for yourself.] 

Whether nonsense or not, Tartaglia and Cardano's formula forced 
mathematicians to confront square roots of negative numbers (or imaginary 
numbers, as they are called today). This historical incident is another 
example that negates the (widespread) view that mathematics is "made 
up" by mathematicians. As is often the case, it is the mathematics itself that 
speaks to us. 


THE MATURING OF COMPLEX NUMBERS. For well over two centuries, 
numbers like i = JA were looked at with great suspicion. The square root 
of any negative number can be written in terms of i; for example, 

J=a = J(a—1) = Ja,/—1. In the middle of the eighteenth century, the 
Swiss mathematician Leonhard Euler connected the universal cosmic 
numbers e and zt with the imaginary number i. Whatever i was or meant, it 
necessarily follows that 


that is, e "raised to the power mi equals —1. Thus, these cosmic numbers, 
reflecting perhaps some deeper mystery, are in fact connected to each other 
by a very simple formula. 

At the beginning of the nineteenth century, the German mathematician 
Karl Friedrich Gauss was able to prove the fundamental theorem of algebra, 
which says that any nth-degree polynomial has n roots (some or all of which 
may be imaginary; that is, the roots have the form a + bi, where, as earlier, 

i = J/—1 and where a and b are real numbers). 

By the middle of the nineteenth century, the French mathematician 
Augusten-Louis Cauchy and the German mathematician Bernhard 
Riemann had developed the differential calculus for functions of one 
complex variable. An example of such a function is F(z) — z", where 


v 
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z= a + bi. In this case, the usual formula for the derivative, F'(z) =nz"—', 


still holds. However, by introducing imaginary numbers, Cauchy was able to 
evaluate “real integrals” that heretofore could not be evaluated. For example, 
it is possible to show that 


and that 
if log sin x dx = —7 log 2, 
0 


These were stunning results. 

In summary, the solution of the cubic equation, the fundamental 
theorem of algebra, and the evaluation of real integrals proved how valuable 
it was to consider imaginary numbers a + bi, even though they were not (at 
least not yet) on terra firma. Did they really exist or were they simply 
phantoms of our imagination, and thus truly imaginary? 


HAMILTON'S DEFINITION OF COMPLEX NUMBERS. Many 
mathematicians after Cardano made important contributions to imaginary 
(or complex) numbers, including Argand, Wessel, and Gauss—all of whom 
represented them geometrically. However, the modern, intellectually 
rigorous definition of a complex number is due to the great Irish 
mathematician William Rowan Hamilton (see Figure 1.3.8). After Newton, 
who created the vector concept through his invention of the notion of force, 
Hamilton was, beyond any doubt, the most important and singular figure in 


Figure 1.3.8 Sir William Rowan 
Hamilton (1805-1865). 
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the development of vector calculus. It was Hamilton who gave us the terms 
vector and scalar quantity. 

William Rowan Hamilton was born in Dublin, Ireland, at midnight on 
August 3, 1805. In 1823, he entered Trinity College, Dublin. His university 
career, by any standard, was phenomenal. By his third year, Trinity offered 
him a professorship, the Andrew's Chair of Astronomy, and the State named 
him Royal Astronomer of Ireland. These honors were based on his 
theoretical prediction (in 1824) of two entirely new and unexpected optical 
phenomena, namely, internal and external conical refraction. 

By 1827 he had become interested in imaginary numbers. He wrote that 
“the symbol J- is absurd, and denotes an impossible extraction . . .” He set 
out to put the idea of a complex number on a firm logical foundation. His 
solution was to define a complex number a 4- bi as a point (a, b) in the plane 
R?, much as we do today. Thus, the imaginary number bi for Hamilton was 
simply the point (0, b) on the y axis. The difference between complex 
numbers and the Cartesian plane was that Hamilton followed the proforma 
multiplication of complex numbers: 


(a + biY(c + di) = (ac — bd) + (ad + bc)i, 
and defined a new multiplication on the complex plane: 
(a, b) - (c, d) = (ac — bd, ad + bc). 


Thus, i = y—1 just disappears into the point (0, 1), and the mystery and 
confusion over complex numbers disappears along with it. 


FROM COMPLEX NUMBERS TO QUATERNIONS. From Hamilton's 
interpretation, complex numbers are nothing more than the extension of real 
numbers into a new dimension, two dimensions. Hamilton, however, also did 
fundamental work in mechanics, and he knew well that two dimensions were 
too limiting for the space analysis necessary for understanding the physics of 
the three-dimensional world. Therefore, Hamilton set out to find a triplet 
system; that is, an acceptable! multiplication scheme on points (a,b,c) in IR?, 
or, as it were, on vectors ai + bj + ck. 

By 1843, Hamilton realized that his quest was hopeless. But then, on 
October 16, 1843, Hamilton discovered that what he could not achieve for 
IR? he could achieve for R^; he discovered quaternions, an entirely new 
number system. 


'For him, "acceptable" meant that the associative law of multiplication would hold. 
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Let us revisit the important historical moment in Hamilton's own 
words: 


But on the 16 day of the same month— which happened to be a 
Monday, and a Council Day of the Royal Irish Academy—I was 
walking in to attend and preside, and your mother was walking 
with me, along the Royal Canal, to which she had perhaps driven; 
and although she talked with me now and then, yet another 
under-current of thought was going on in my mind, which gave at 
last a result, whereof it is not too much to say that I felt at once the 
importance. An electric circuit seemed to close; and a spark flashed 
forth, the herald (as I foresaw, immediately) of many long years to 
come of definitely directed thought and work, by myself if spared, 
and at all events on the parts of others, if I should even be allowed to 
live long enough to distinctly communicate the discovery. Nor could 
I resist the impulse—unphilosophical as it may have been—to cut 
with a knife on a stone of Brougham Bridge, as we passed it, the 
fundamental formula with the symbols i, j,k; namely 


v=; kik 1, 


which contains the Solution of the Problem, but of course, as an 
inscription, has long since moldered away.” 


Hamilton had realized that the multiplication he had been searching for 
could be introduced on 4-tuples (a, b, c, d), which he had denoted by 


a+bi+cj+dk. 


The a was called the scalar part and bi + cj + dk was called the vector 
part, which in reality, as with complex numbers, meant the point (a, b, c, d) 
in R^. The multiplication table he introduced was 


iįj=k= -ji 
ki =j = —ik 
jk =i=—kj 


Hamilton continued to passionately believe in his quaternions until the 
end of his life. Unfortunately, historical development went in another 
direction. 


?North British Review 14 (1858): 57. 
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The first step away from the quaternions was in fact taken by a 
firm believer in the importance of quaternions, namely, Peter Guthrie Tait, 
who was born in 1831 near Edinburgh, Scotland. In 1860, Tait was 
appointed to the Chair of Natural Philosophy at Edinburgh University, 
where he remained until his death in 1901. In 1867, he wrote his Elementary 
Treatises on Quaternions, a text stressing physical applications. His third 
chapter was most significant. It was here that Tait looked at the quaternionic 
product of two vectors: 


v=aitbj+ck and — wai bj 4 c'k. 
Then the product vw, as defined by Hamilton, yields: 
(ai+ bj + ck)(a'i 4- b'j 4- c'k) 
= —(aa' + bb’ + cc") + (be! — cb’)i + (ac’ — ca’)j + (ab — ba^)k 
or, in modern form: 
vw =-—(v-w)+vxw, 


where - is the modern dot or inner product of vectors and x is the cross 
product. Tait discovered the formulas 


v- w= Ilvilllwi|cos@ ^ and |v x wli = [vill wil sind, 


where @ is the angle formed by v and w. Moreover, he showed that v x w was 
orthogonal to v and w, therefore giving a geometric interpretation of the 
quaternionic product of two vectors. 

This began the move away from the study of quaternions and back to 
Newton's vectors, with the quaternionic product eventually being replaced 
by two separate products, the inner product and the cross product. 

By the way, you might wonder why Hamilton did not at first discover the 
cross product, since it is a product on IR?. The reason is that it did not have a 
fundamental property that he required—namely, it was not associative:? 


O0O=(ixi)xk#ix(ixk)=—k 


Interestingly, if one is willing to continue to live with nonassociativity, there is also a vector product with most 
of the properties of the cross product in R7; this involves yet another number system called the octonians, which 
exists in R®. The nonexistence of a cross product in other dimensions is a result that goes beyond the scope 
of this text. For further information, see the American Mathematical Monthly, 74 (1967), pp. 188-194, and 90 
(1983), p. 697, as well as J. Baez, “The Octonians,” Bulletin of the American Mathematical Society, 39 (2002), 
pp. 145-206. One can show that systems like the quaternions and octonians occur only in dimension 1 (the reals 
R), dimension 2 (the complex numbers), dimension 4 (the quaternions), and dimension 8 (the octonians). On 
the other hand, the "right"way to extend the cross product is to introduce the notion of differential forms, which 
exists in any dimension. We discuss their construction in Section 8.6. 
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THE MOVE AWAY FROM QUATERNIONS. The scientists ultimately 
responsible for the demise of quaternions were James Clerk Maxwell 

(see Figure 1.3.9), Oliver Heaviside, and Josiah Willard Gibbs, a founder of 
statistical mechanics. In the 1860s, Maxwell wrote down his monumental 
equations of electricity and magnetism. No vector notation was used (it did 
not exist). Instead, Maxwell wrote out his equations in what we would now 
call “component form.” Around 1870, Tait began to correspond with 
Maxwell, piquing his interest in quaternions. 


Figure 1.3.9 James Clerk 
Maxwell (1831-1879). 


In 1873, Maxwell published his epic work, Treatise on Electricity and 
Magnetism. Here (as we shall do in Chapter 8), Maxwell wrote down the 
equations of the electromagnetic field using quaternions, thus motivating 
physicists and mathematicians alike to take a closer look at them. From this 
manuscript many have concluded that Maxwell was a supporter of the 
“quaternionic approach” to physics. The truth, however, is that Maxwell 
was reluctant to use quaternions. It was Maxwell, in fact, who began the 
process of separating the vector part of a product of two quaternions (the 
cross product) from its scalar part (the dot product). 

It is known that Maxwell was troubled by the fact that the scalar part of 
the “square” of a vector (vv) was always negative (—v - v), which in the case 
of a velocity vector could be interpreted as negative kinetic energy—an 
unacceptable idea! 

It was Heaviside and Gibbs who made the final push away from 
quaternions. Heaviside, an independent researcher interested in electricity 
and magnetism, and Gibbs, a professor of mathematical physics at Yale, 
almost simultaneously—and independently—crcated our modern system of 
vector analysis, which we have just started to study. 
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In 1879, Gibbs taught a course at Yale in vector analysis with. 
applications to electricity and magnetism. This treatise was clearly motivated 
by the advent of Maxwell'sequations, which we will be studying in Chapter 
8. In 1884, he published his Elements of Vector Analysis, a book in which 
all the properties of the dot and cross products are fully developed. 

Knowing that much of what Gibbs wrote was in fact due to Tait, Gibbs's 
contemporaries did not view his book as highly original. However, it is one of 
the sources from which modern vector analysis has come into existence. 

Heaviside was also largely motivated by Maxwell's brilliant work. His 
great Electromagnetic Theory was published in three volumes. Volume I 
(1893) contained the first extensive treatment of modern vector analysis. 

We all owe a great debt to E. B. Wilson's1901 book Vector Analysis: A 
Textbook for the Use of Students of Mathematics and Physics Founded upon the 
Lectures of J. Willard Gibbs. Wilson was reluctant to take Gibbs’s course, 
because he had just completed a full-year course in quaternions at Harvard 
under J. M. Pierce, a champion of quaternionic methods; but he was forced 
by a dean to add the course to his program, and he did so in 1899. Wilson 
was later asked by the editor of the Yale Bicentennial Series to write a book 
based on Gibbs's lectures. For a picture of Gibbs and for additional 
historical comments on divergence and curl, see the Historical Note in 
Section 4.4. 


EXERCISES 


1. 


Verify that interchanging the first two rows of the 3 x 3 determinant 


[SN 


1 
1 
2 


Coon 


changes the sign of the determinant. 


2. 


Evaluate the determinants 
(922 -1 0 ()|]] 4 9 
4 32 4 9 16 
3 0 1 9 16 25 
(b)|36 18 17 (0|2 3 5 
45 24 20 7 WW 13 
3 5 -2 17 19 23 


Compute a x b, where a = i — 2j + k, b = 2i+j+k. 
Compute a - (b x c), where a and b are as in Exercise 3 and c = 3i — j + 2k. 


Find the area of the parallelogram with sides a and b given in Exercise 3. 


62 The Geometry of Euclidean Space 


6. A triangle has vertices (0, 0, 0), (1, 1, 1), and (0, —2, 3). Find its area. 
7. What is the volume of the parallelepiped with sides 2i + j — k, 5i — 3k, and i — 2j + k? 


8. What is the volume of the parallelepiped with sides i, 3j — k, and 4i + 2j — k? 


In Exercises 9 to 12, describe all unit vectors orthogonal to both of the given vectors. 
9. iij 
10. —5i 4- 9j — 4k, 7i+ 8j + 9k 
11. —5i 4- 9j — 4k, 7i + 8j + 9k, 0 
12. 2i — 4j + 3k, —4i + 8j — 6k 
13. Compute u + v, u- v, |lull, ||vll, and u x v, where u = i — 2j + k, v = 2i — j + 2k. 
14. Repeat Exercise 13 for u = 3i + j — k, v = —6i — 2j — 2k. 


15. Find an equation for the plane that 


(a) is perpendicular to v = (1, 1, 1) and passes through (1, 0, 0). 

(b) is perpendicular to v — (1, 2, 3) and passes through (1, 1, 1). 

(c) is perpendicular to the line I(t) = (5, 0, 2)t + (3, —1, 1) and passes through 
(5, —1, 0). 

(d) is perpendicular to the line I(t) = (—1, —2, 3)t + (0, 7, 1) and passes through 
(2,4, -1). 
16. Find an equation for the plane that passes through 


(a) (0,0, 0), (2, 0, — 1), and (0, 4, —3). 
(b) (1,2, 0), (0, 1, —2), and (4, 0, 1). 
(c) (2, —1, 3), (0, 0, 5), and (5, 7, —1). 


17. (a) Show that two parallel planes are either identical or they never intersect. 
(b) How do two nonparallel planes intersect? 


18. Find the intersection of the planes x + 2y +z = 0 and x — 3y — z = 0. 
19. Find the intersection of the planes x + (y — 1) --z = 0and —x + (y + 1) -z — 0. 


20. Find the intersection of the two planes with equations 3(x — 1) + 2y + (z + 1) = 0 and 
(x — 1) 4y - € * 1) — 0. 


21. (a) Prove the two triple-vector-product identities 
(a x b) x e—(a:c)b — (b - c)a and a x (b x c) — (a:c)b — (a: b)c. 


(b) Prove (u x v) x w—u x (v x w) ifand only if (u x w) x v — 0. 
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(c) Also prove that (u x v) x w + (v x w) x u + (w x u) x v = 0 (called the Jacobi 
identity). 


22. (a) Prove, without recourse to geometry, that 


u-(vx w) = v- (w x u) = w- (u x v) = —u-(w x v) 


= —w:(v x u) = —v-(u x w). 


(b) Use part (a) and Exercise 21(a) to prove that 


(u xv) (w x v) = (uu): v) uvu v) = s 


23. Verify Cramer’s rule. 


24. Find an equation for the plane that passes through the point (2, —1, 3) and is 
perpendicular to the line v = (1, —2, 2) + t(3, —2, 4). 


25. Find an equation for the plane that passes through the point (1, 2, —3) and is 
perpendicular to the line v = (0, —2, 1) + (1, —2, 3). 


26. Find the equation of the line that passes through the point (1, —2, —3) and is 
perpendicular to the plane 3x — y — 2z + 4 = 0. 


27. Find an equation for the plane containing the two (parallel) lines 
vi = (0, 1, 2) + ¢(2, 3, —1) and v2 = (2, —1, 0) + (2,3, —1). 
28. Find the distance from the point (2, 1, —1) to the plane x — 2y + 2z + 5 = 0. 


29. Find an equation for the plane that contains the line v = (—1, 1, 2) + ¢(3, 2, 4) and is 
perpendicular to the plane 2x + y — 3z + 4 = 0. 


30. Find an equation for the plane that passes through (3, 2, —1) and (1, —1, 2) and that is 
parallel to the line v = (1, —1, 0) + £(3, 2, —2). 


31. Redo Exercises 19 and 20 of Section 1.1 using the dot product and what you know about 
normals to planes. 


32. Given vectors a and b, do the equations x x a = b and x - a = ||a|| determine a unique 
vector x? Argue both geometrically and analytically. 


33. Determine the distance from the plane 12x + 13y + 5z + 2 = 0 to the point (1, 1, —5). 


34. Find the distance to the point (6, 1, 0) from the plane through the origin that is 
perpendicular to i — 2j 4- k. 


35. (a) In mechanics, the moment M of a force F about a point O is defined to be the 
magnitude of F times the perpendicular distance d from O to the line of action of F. The 
vector moment M is the vector of magnitude M whose direction is perpendicular to the plane 
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of O and F, determined by the right-hand rule. Show that M — R x F, where R is any vector 
from O to the line of action of F. (See Figure 1.3.10.) 


Figure 1.3.10 Moment of a force. 


Line of action 


(b) Find the moment of the force vector F = i — j + 2k newtons about the origin if the 
line of action is x = 1 +t, y = 1 — t,z — 2t. 


36. Show that the plane that passes through the three points A = (a), a2, a3), B = 
(bi, b2, b3), and C = (ci, c2, c3) consists of the points P = (x, y, z) given by 


d|—X d2—y a-z 
bj —x b;—y by—z|—0. 
C€(—X a-y Gz 


(Hnr: Write the determinant as a triple product.) 

37. Two media with indices of refraction n; and n, are separated by a plane surface 
perpendicular to the unit vector N. Let a and b be unit vectors along the incident and 
refracted rays, respectively, their directions being those of the light rays. Show that 


n\(N x a) = m(N x b) by using Snell law, sin 6; / sin 02 = n2/n,, where 6, and @ are the 
angles of incidence and refraction, respectively. (See Figure 1.3.11.) 


light ray 


Figure 1.3.11 Snell's law. 
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38. Justify the steps in the following computation: 


123] 2 3 l2 3,534 | 
45 6/=|0 -3 -6-J|o -3 -s- Ajo -36- 
7 8 || [7 8 10 fo -6 -H 


39. Show that adding a multiple of the first row of a matrix to the second row leaves the 
determinant unchanged; that is, 


a by e a b c 
a+ìa, bjicÀbi &a+ica|=|a bj cl. 
a3 bs C3 a3 by 65 


[In fact, adding a multiple of any row (column) of a matrix to another row (column) leaves the 
determinant unchanged.] 
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A standard way to represent a point in the plane IR? is by means of rectangular 
coordinates (x , y). However, as the reader has probably learned in elementary calculus, 
polar coordinates in the plane can be extremely useful. As portrayed in Figure 1.4.1, 
the coordinates (r, 0) are related to (x, y) by the formulas 

x =rcosé and y —rsin6, 


where we usually take r > 0 and 0 < 0 < 27. 


/ Y Figure 1.4.1 The polar coordinates of (x, y) are (r, 6). 
———— X 


Readers not familiar with polar coordinates are advised to study the relevant 
section of their calculus texts. We now set forth two ways of representing points in 
space other than by using rectangular Cartesian coordinates (x, y, z). These alternative 
coordinate systems are particularly well suited for certain types of problems, such as 
the evaluation of integrals using a change of variables. 
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— ioa Note 


| 


In 1671, Isaac Newton wrote a manuscript entitled The Method of Fuxions 
and Infinite Series, which contains many uses of coordinate geometry to 
sketch the solutions of equations. In particular, he introduces the polar 
coordinate system, among various other coordinate systems. 


In 1691, Jacob Bernoulli published a paper also containing polar 
coordinates. Because Newton's manuscript was not published until after his“ 
death in 1727, credit for the discovery of polar coordinates is usually 
attributed to Bernoulli. 


Cylindrical Coordinates 


DEFINITION The cylindrical coordinates (r, 0, z) of a point (x, y, z) are 


| defined by (see Figure 1.4.2) 


| x —r cos6, y —rsin6, Ze. (1) 


e 2) 


Figure 1.4.2 Representing a point (x, y, z) in terms of its 


d cylindrical coordinates r, 0, and z. 


p SERIE o 


eG 


To express r, 0, and z in terms of x, y, and z, and to ensure that 0 lies between 0 and 
27, we can write 


tan! (y/x) ifx > Oandy >0 


r=Ęx?+y?,  O=4n+tan!(y/x) ifx <0 REZ, 
2n + tan (y/x) ifx > Oandy <0, 


1.4 Cylindrical and Spherical Coordinates 67 


where tan^! ( y/x) is taken to lie between —7:/2 and 2/2. The requirement that 0 < 
0 < 21 uniquely determines 0 andr > 0 for a given x and y. If x = 0, then0 = 2/2 
for y > 0 and 37/2 for y < 0. If x = y = 0,0 is undefined. 

In other words, for any point (x, y, z), we represent the first and second co- 
ordinates in terms of polar coordinates and leave the third coordinate unchanged. 
Formula (1) shows that, given (r, 0, z), the triple (x, y, z) is completely determined, 
and vice versa, if we restrict 0 to the interval [0, 27r) (sometimes the range (—7, x] 
is convenient) and require that r > 0. 

To see why we use the term cylindrical coordinates, note that if the conditions 
0 < 0 «23, —oo <z < co hold and if r =a is some positive constant, then the 
locus of these points is a cylinder of radius a (see Figure 1.4.3). 


Figure 1.4.3 The graph of the points whose cylindrical 
coordinates satisfy r — a is a cylinder. 


(a) Find and plot the cylindrical coordinates of (6, 6, 8). (b) If a 
point has cylindrical coordinates (8, 27/3, —3), what are its Cartesian coordinates? 


Plot. 


SOLUTION For part (a), we have r = 4/6? + 6? = 64/2 and 0 = tan! (6/6) = 
tan (1) = 7/4. Thus, the cylindrical coordinates are (6/2, 71/4, 8). This is point P 
in Figure 1.4.4. For part (b), note that 27/3 = 2/2 + 2/6 and compute 


2n 8 
0-8 = 4 
x= r cos C057, 5 


and 


y=rsind = sin = a8 aa. 


Thus, the Cartesian coordinates are (—4, 44/3, —3). This is point Q in the figure. A 
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Zz 


V P(6, 6, 8) 
pe 


Figure 1.4.4 Some examples of 
the conversion between Cartesian 
and cylindrical coordinates. 


$ 
Q(-A, 43, -3) 


Spherical Coordinates 


Cylindrical coordinates are not the only possible generalization of polar coordinates 
to three dimensions. Recall that in two dimensions the magnitude of the vector xi + yj 
(that is, /x? + y?) is ther in the polar coordinate system. For cylindrical coordinates, 
the length of the vector xi + yj + zk, namely, 


pax +y +2, 


is not one of the coordinates of that system—instead, we used the magnitude 
r = \/x? + y?, the angle 0, and the “height” z. 

We now modify this by introducing the spherical coordinate system, which does 
use p as a coordinate. Spherical coordinates are often useful for problems that possess 
spherical symmetry (symmetry about a point), whereas cylindrical coordinates can 
be applied when cylindrical symmetry (symmetry about a line) is involved. 

Given a point (x, y, z) € IR?, let 


p=yxity +z 
and represent x and y by polar coordinates in the xy plane: 
x —rcos6, y =r sin (2) 


where r = yx? + y? and 0 is determined by formula (1) [see the expression for 6 
following formula (1)]. The coordinate z is given by 


z = pcosó, 
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where ¢ is the angle (chosen to lie between 0 and zr, inclusive) that the radius vector 
v = xi + yj + zk makes with the positive z axis, in the plane containing the vector v 
and the z axis (see Figure 1.4.5). Using the dot product, we can express ¢ as follows: 


+k Mk 
cosó = YU thai, $- cos (TF). 
lvii livi] 


(x, yz) 


Figure 1.4.5 Spherical coordinates (p, 0, $); the 
graph of points satisfying p = a is a sphere. 


We take as our coordinates the quantities p, 0, p. Because 
r=psing, 


we can use formula (2) to find x, y, z in terms of the spherical coordinates p, 6, $. 


DEFINITION The spherical coordinates of (x, y, z) is the triple (o, 6, $), 
defined as follows: 


x = psin$ cos6, y = psin sin, z = pcosó (3) 
where 


p20, 0x80 «2, 0xó-zm. 


celi sif Note 


In 1773, Joseph Louis Lagrange was working on Newton’s gravitational 
theory as it applied to ellipsoids of revolution. In attempting to calculate the 
total gravitational attraction of such an ellipsoid, he encountered an integral 
that was difficult to evaluate. Motivated by this application, he introduced 
spherical coordinates, which allowed him to calculate the integral. We will be 
discussing the method of changing coordinates as it applies to multiple 
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integrals in Section 6.2, and applications to gravitation in Section 6.3, where 
we show how the inverse square law of gravity allowed Newton to consider 
spherical masses as point masses. 

Spherical coordinates are also closely connected to navigation by latitude 
and longitude. To see the connection, first note that the sphere of radius a 
centered at the origin is described by a very simple equation in spherical 


coordinates, namely, p = a. Fixing the radius a, the spherical coordinates 6 
and ¢ are similar to the geographic coordinates of longitude and latitude if we 
take the earth’s axis to be the z axis. There are differences, though: The 
geographical longitude is || and is called east or west longitude, according to 
whether 0 is a positive or negative measure from the Greenwich meridian; 
the geographical latitude is |7 /2 — | and is called north or south latitude, 
according to whether 77/2 — $ is positive or negative. 


EXAMPLE 2 


(a) Find the spherical coordinates of the Cartesian point (1, —1, 1) and plot. 


(b) Find the Cartesian coordinates of the spherical coordinate point (3, 7/6, 7/4) 
and plot. 


(c) Leta point have Cartesian coordinates (2, —3, 6). Find its spherical coordinates 
and plot. 


(d) Leta point have spherical coordinates (1, —7r /2, 1/4). Find its Cartesian coor- 
dinates and plot. 


SOLUTION 
(à) p=fery+2=J/P+ClP +l = V3, 


=l 7 
9 — 2n tw (Z) =2n e (1) =2n- T= = 


$ 0 s (=) & 0.955 ~ 54.74°. 


See Figure 1.4.6(a) and the formula for 0 following formula (1). 


(b x =psingcosé = 3sin(=) cos() = 3(=) A u x 


y =psing sin 3sin(7 sin(=) -3(+) (3) = aa 


z = pcosġ 3cos(7) - x = nm 


See Figure 1.4.6(b). 


(a) 


(1,—1,1) p= 


$ 


45 


557 
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Figure 1.4.6 Finding (a) the 
spherical coordinates of the 
point (1, 1, 1), and (b) the 
Cartesian coordinates of 

(3, 7/6, 7/4). 


() pHVx?+y? +z = J2 + (32 +6 = V49 =7, 


6=2n+ tan (7) —2z + tan! (2) & 5.3004 radians ~ 303.69°, 


x 


6 
$ = cos! (2) = cos! ($) ~ 0.541 ~ 31.0°. 


See Figure 1.4.7(a). 


Figure 1.4.7 Finding (a) the 
spherical coordinates of the point 
(2, —3, 6), and (b) the Cartesian 
coordinates of (1, —7:/2, 2/4). 


(d) x = psinó cost = 1 sin(7. cos( 2)=(2)-0=0 
jali -(z)e»- 


2 
z=pcos¢= 1eos(7) = al 


y = psin$ sin 1sin(7 


See Figure 1.4.7(b) A 
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DENISE] Express (a) the surface xz = 1 and (b) the surface x? + y? — 


z? — | in spherical coordinates. 


SOLUTION From formula (3), x = psin$ cos 8, and z = p cos, and so the 
surface xz = 1 in (a) consists of all (p, 6, $) such that 


p? sing cos0 cos ó = 1, that is, p? sin2¢ cos = 2. 
For part (b), we can write 


xy — 2? =x? 4 y? 427 — 22? = p* — 2p" cos’, 


so that the surface is p?(1 — 2 cos? $) = 1; that is, —p* cos(20) = 1. A 


Associated with cylindrical and spherical coordinates are unit vectors that are the 
counterparts of i, j, and k for rectangular coordinates. They are shown in Figure 1.4.8. 
For example, e, is the unit vector parallel to the xy plane and in the radial direction, so 
that e = (cos @)i + (sin @)j. Similarly, in spherical coordinates, eg is the unit vector 
tangent to the curve parametrized by the variable @ with the variables p and 0 held 
fixed. We shall use these unit vectors later when we use cylindrical and spherical 
coordinates in vector calculations. 


x 


Figure 1.4.8 (a) Orthonormal vectors e,, eo, and e; associated with cylindrical 
coordinates. The vector e, is parallel to the line labeled r. (b) Orthonormal vectors 
€p, €o, and eg associated with spherical coordinates. 


EXERCISES 


1. (a) The following points are given in cylindrical coordinates; express each in 
rectangular coordinates and spherical coordinates: (1, 45°, 1), (2, 7/2, —4), (0, 45°, 10), 
(3, 7/6, 4), (1, 2/6, 0), and (2, 37/4, —2). (Only the first point is solved in the Study 
Guide.) 

(b) Change each of the following points from rectangular coordinates to spherical 
coordinates and to cylindrical coordinates: (2, 1, —2), (0, 3, 4), (2, 1,1), (-245, —2,3). 
(Only the first point is solved in the Study Guide.) 
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2. Describe the geometric meaning of the following mappings in cylindrical coordinates: 
(a) (r0, z) 9 (r, 0, —z) 
(b) (r, 0,2)  (r,0 +7, —z) 
(c) (0,2) e» (—r, 0 — 7/4, z) 


3. Describe the geometric meaning of the following mappings in spherical coordinates: 


(a) (0,0, $) (0,0 4-7, $) 
(b) (p, 0,6) > (p,0, 7 — $) 
(c) (p, 8,6) +> Qp, 0 + 1/2, $) 


4. (a) Describe the surfaces r — constant, 0 — constant, and z — constant in the 
cylindrical coordinate system. 
(b) Describe the surfaces p = constant, @ = constant, and $ = constant in the spherical 
coordinate system. 


5. Show that to represent each point in R? by spherical coordinates it is necessary to take 
only values of 0 between 0 and 2x, values of $ between 0 and zr, and values of p > 0. Are 
coordinates unique if we allow p < 0? 


6. Using cylindrical coordinates and the orthonormal (orthogonal normalized) vectors 
€,, ej, and e, (see Figure 1.4.8), 


(a) express each of e,, eg, and e. in terms of i, j, k and (x, y, z); and 
(b) calculate eg x j both analytically, using part (a), and geometrically. 


7. Using spherical coordinates and the orthonormal (orthogonal normalized) vectors e, , ej, 
and ey (see Figure 1.4.8(b)), 
(a) express each of e,, eg, and e, in terms of i, j, k and (x, y, z); and 
(b) calculate e; x j and ej x j both analytically and geometrically. 


8. Express the plane z — x in (a) cylindrical, and (b) spherical coordinates. 


9. Show that in spherical coordinates: 


(a) p is the length of xi + yj + zk. 
(b) $ = cos"! (v- k/||v|]), where v = xi + yj + zk. 
(c) 0 = cos"! (u-i/|ul]), where u = xi + yj. 


10. Two surfaces are described in spherical coordinates by the two equations p = f(0, $) and 
p = —2 f (0, $), where f(0, $) is a function of two variables. How is the second surface 
obtained geometrically from the first? 


11. A circular membrane in space lies over the region x? + y? < a?. The maximum z 
component of points in the membrane is b. Assume that (x, y, z) is a point on the membrane. 
Show that the corresponding point (r, 6, z) in cylindrical coordinates satisfies the conditions 
0<r<a,0<6 z2m,|z| x b. 


12. A tank in the shape of a right-circular cylinder of radius 10 ft and height 16 ft is half 
filled and lying on its side. Describe the air space inside the tank by suitably chosen 
cylindrical coordinates. 


74 


The Geometry of Euclidean Space 


13. A vibrometer is to be designed that withstands the heating effects of its spherical 
enclosure of diameter d, which is buried to a depth d/3 in the earth, the upper portion being 
heated by the sun (assume the surface is flat). Heat conduction analysis requires a description 
of the buried portion of the enclosure in spherical coordinates. Find it. 


14. An oil filter cartridge is a porous right-circular cylinder inside which oil diffuses from 

the axis to the outer curved surface. Describe the cartridge in cylindrical coordinates, if the 
diameter of the filter is 4.5 inches, the height is 5.6 inches, and the center of the cartridge is 
drilled (all the way through) from the top to admit a 3-inch-diameter bolt. 


15. Describe the surface given in spherical coordinates by p — cos 20. 
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Vectors in n-space 


In Sections 1.1 and 1.2 we studied the spaces R = Rt, R2, and IR? and gave geometric 
interpretations to them. For example, a point (x, y, z) in R? can be thought of as a geo- 
metric object, namely, the directed line segment or vector emanating from the origin 
and ending at the point (x, y, z). We can therefore think of IR? in either of two ways: 


(i) Algebraically, as a set of triples (x, y, z) where x, y, and z are real numbers 


(ii) Geometrically, as a set of directed line segments 


These two ways of looking at IR? are equivalent. For generalization it is easier to 
use definition (i). Specifically, we can define IR", where n is a positive integer (possibly 
greater than 3), to be the set of all ordered n-tuples (x1, x2, . . . , Xn), where the x; are 
real numbers. For instance, (1, 5, 2, 4/3) € R^. 

The set R" so defined is known as Euclidean n-space, and its elements, which we 
write as X = (x1, X2, ... , Xn), are known as vectors or n-vectors. By setting n = 1,2, 
or 3, we recover the line, the plane, and three-dimensional space, respectively. 

We launch our study of Euclidean n-space by introducing several algebraic op- 
erations. These are analogous to those introduced in Section 1.1 for R? and I?. The 
first two, addition and scalar multiplication, are defined as follows: 


G) Qu. x2. x) HOi asses Yn) = Ga + V1, X2 + Y2, ss Xn + Yn); 
and 


(ii) for any real number a, 
CRY X3, eso Ky) = LN OND, e. s OD). 


The geometric significance of these operations for R? and R? was discussed in 
Section 1.1. 
The n vectors 


ei = (1,0,0, ..., 0), e2 = (0,1,0,...,0),..., 6, =(0,0,...,0, 1) 
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are called the standard basis vectors of IR", and they generalize the three mutually 
orthogonal unit vectors i, j,k of IR. The vector x = (xi, x», ..., Xn) can then be 
written as X = xj@; + x2e2 +-+- + x,e,. 

For two vectors x = (x, x», x3) and y = (yi, y2, 3) in R?, we defined the dot or 
inner product x - y to be the real number x - y = x1yi + X22 + x3ys. This definition 
easily extends to IR"; specifically, for x = (xi, 32, .... Xn), y = (1, Y2, --- Yn), we 
define the inner product of x and y to be x- y = xiyi + xoyo +--+ + x4 ys. In IR", 
the notation (x, y) is often used in place of x - y for the inner product. 

Continuing the analogy with R?, we are led to define the notion of the length or 
norm of a vector x by the formula 


Length of x = |x| = Vx- X = Vx? +x? +- x2. 


If x and y are two vectors in the plane (IR?) or in space (IR?), then we know that 
the angle 0 between them is given by the formula 


x-y 
lixll yl 


The right side of this equation can be defined in IR" as well as in R? or IR?. It still 
represents the cosine of the angle between x and y; this angle is geometrically well 
defined, because x and y lie in a two-dimensional subspace of R” (the plane determined 
by x and y) and our usual geometry ideas apply to such planes. 

It will be useful to have available some algebraic properties of the inner product. 
These are summarized in the next theorem [compare with properties (i), (ii), (iii), and 
(iv) of Section 1.2]. 


THEOREM 3 For x, y, z € R” anda, B, real numbers, we have 
(i) (ax + By)-z = o(x-2) + B(y-2). 

(ii) x-y=y-x. 

(iii) x-x > 0. 

(iv) x-x = 0 if and only if x = 0. 

PROOF Each of the four assertions can be proved by a simple computation. For 

example, to prove property (i) we write 

(ax + By) -z = (axi + Byi, a2 + By2, - - - Xn + BYn)* (Z1, Z2, - - -> Zn) 

= (axi + Byi)zi + (ax2 + By)za + +++ + (xs + BYn)Zn 
= axyzy  Byizi  oxaza + Byaza + +++ + AXnZ + BYnZn 
= a(x- z) + (y - z). 


The other proofs are similar. Wi 
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In Section 1.2, we proved an interesting property of dot products, called the 
Cauchy-Schwarz inequality.^ For R our proof required the use of the law of cosines. 
For R” we could also use this method, by confining our attention to a plane in IR". 
However, we can also give a direct, completely algebraic proof. 


THEOREM 4: Cauchy-Schwarz Inequality in R" Let x,y be 


vectors in IR". Then 


Ix* yl < Iixillyll- 


PROOF Leta = y-y and b = —x- y. If a = 0, the theorem is clearly valid, be- 
cause then y — 0 and both sides of the inequality reduce to 0. Thus, we may suppose 
a # 0. By Theorem 3 we have 


0 < (ax + by) - (ax + by) = a°x -x + 2abx-y +b°y -y 
= (y-y)x-x—(y-y)(x-y) 


Dividing by y -y gives 0 < (y-y)(x-x) — (x-y)’, that is, (x- y < (x-x)(y-y) = 
lx|7lly||?. Taking square roots on both sides of this inequality yields the desired 
result. m 


There is a useful consequence of the Cauchy-Schwarz inequality in terms of 
lengths. The triangle inequality is geometrically clear in IR? and was discussed in 
Section 1.2. The analytic proof of the triangle inequality that we gave in Section 1.2 
works exactly the same in IR" and proves the following: 


COROLLARY: Triangle Inequality in R” Let x, y be vectors in R”. Then 


lx 4- yll < lxii + Iyl. 


If Theorem 4 and its corollary are written out algebraically, they become the 


following useful inequalities: 

z P 27» 1/2 
SEDES (5x) (5x) : 
i=] i=l i=l 


r 1/2 ž 1/2 a 1/2 
"n e») E (5 s) $ (5x) . 
i=l i=l i=l 


4Sometimes called the Cauchy-Bunyakovskii-Schwarz inequality, or simply the CBS inequality, because it was indepen- 
dently discovered in special cases by the French mathematician Cauchy, the Russian mathematician Bunyakovskii, and 
the German mathematician Schwarz. 
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IC WZIBSSE Letx = (1,2, 0, — 1) andy = (—1, 1, 1, 0). Verify Theorem 4 and 


its corollary in this case. 


SOLUTION 


Ixl = V 4-220 c (713 = V6 
Iyl = VC? +P + 1? +0 = V3 
x-y = 1(—1)+2-1+0-1+(-1)0=1 
x+y = (; 3,1, =1) 
IIx +yl] = 02 +324 P + (-1)? = VII. 


We compute x «y = 1 < 4.24 ~ V6V3 = ||xl|l|yll, which verifies Theorem 4. Simi- 
larly, we can check its corollary: 


Ix + yl] = v11 ~ 3.32 
< 4.18 = 2.45 + 1.73 & V6-- 4/3 = Ixll + lly. à 
By analogy with IR?, we can define the notion of distance in R”; namely, if x and 
y are points in R”, the distance between x and y is defined to be ||x — y||, or the length 


of the vector x — y. We do not attempt to define the cross product on IR" except for 
n=3. 


General Matrices 


Generalizing 2 x 2 and 3 x 3 matrices (see Section 1.3), we can consider m x n 
matrices, which are arrays of mn numbers: 


au 412 ain 

a2, an 2n 
A= : 

Ami Am2 ^^^ Amn 


We shall also write A as [aj;]. We define addition and multiplication by a scalar 
componentwise, just as we did for vectors. Given two m x n matrices A and B, we 
can add them to obtain a new m x n matrix C = A + B, whose ijth entry cj; is the 
sum of a;; and b;;. It is clear that A + B = B+ A. 


EXAMPLE 2 


210 -1 1 3 12 7 
@) E 4 |+ [o 0 Jeg 4 3 
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©) [1 3+0 -g-rp m. 


w [1.em:, 
9 i3 aie 3] |a. af 

Given a scalar A and an m x n matrix A, we can multiply A by A to obtain a new 
m x n matrix AA = C, whose ijth entry cj; is the product Aa;;. 


EXAMPLE 3 


Next we turn to matrix multiplication. If A = [a;;], B = [bij] aren x n matrices, 
then the product AB = C has entries given by 


cy = X ainbs;, 
k=l 
which is the dot product of the ith row of A and the jth column of B: 


jth column 


ith row Fa] 


~ bnn 


EXAMPLE 4 Let 


103 0 1 
A=]2 10 and B-—[1 0 
100 0 1 
and "-l 


Then 


a 
to 
Il 
o=o 
-NA 
Oow 
pe 


Observe that AB # BA. A 
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Similarly, we can multiply an m x n matrix (m rows, n columns) by an n x p 
matrix (n rows, p columns) to obtain an m x p matrix (m rows, p columns) by the 
same rule. Note that for AB to be defined, the number of columns of A must equal 
the number of rows of B. 


EXAMPLE 5 Let 


Then 


and BA is not defined. A 


EXAMPLE Let 


1 
2 
A= 1 and B=(2 2 1 2] 
3 
Then 
22 1 2 
442 4 
AB= 2212 and BA-[13]. 4 
6 6 36 


Any m x n matrix A determines a mapping of R” to R” defined as follows: Let 
X = (xi, ..., Xn) € R”; consider the n x 1 column matrix associated with x, which 
we shall temporarily denote x" 


and multiply A by x^ (considered to be an n x 1 matrix) to get a new m x 1 matrix: 
an co Gin x yı 
Ax =| : E EEI i =r 


Ami `> mn Xn Ym 
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corresponding to the vector y = (y1, - - - , Ym).* Thus, although it may cause some 
confusion, we will write x = (x1, ..., Xn) and y = (yr, ..., Ym) as column matrices 
xı M 
x-|:|^ y=]: 
Xn Ym 


when dealing with matrix multiplication; that is, we will identify these two forms of 
writing vectors. Thus, we will delete the 7 on x” and view x^ and x as the same. 

Thus, Ax = y will “really” mean the following: Write x as a column matrix, 
multiply it by A, and let y be the vector whose components are those of the resulting 
column matrix. The rule x +> Ax therefore defines a mapping of IR" to IR". This 
mapping is linear; that is, it satisfies 


A(x +y) = Ax+ Ay 
A(ax) = a(Ax), æ a scalar, 
as may be easily verified. One learns in a linear algebra course that, conversely, any 
linear transformation of R” to R” is representable in this way by an m x n matrix. 
If A = [a;j] is an m x n matrix and e; is the jth standard basis vector of IR", 


then Ae; is a vector in R” with components the same as the jth column of A. That 
is, the ith component of Ae; is a;j. In symbols, (4e;); = aij. 


<AMPLE 7 iiid 


10 3 
-101 
4- 9 1 2) 
-1 2 1 


then x +> Ax of R? to IR* is the mapping defined by 


" xi + 3x3 
$ — cx 
na 2x, + x2 + 2x3 
—x| + 2x2 4- x3 
5To use a matrix A to get a mapping from vectors x = (xi. .... Xn) to vectors y = (yi, .... Yn) according to the equation 
Ax" = y", we write the vectors in the column form x^ instead of the row form (xi, ..., Xn). This sudden switch from 


writing x as a row to writing x as a column is necessitated by standard conventions on matrix multiplication, 
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DGVB] The following illustrates what happens to a specific point when 
mapped by a 4 x 3 matrix: 


4 2.9 0 2 

Ae) = Ae 1| 3 = 2nd column of A. A 
123 0 2 
0 1 2 1 


Properties of Matrices 


Matrix multiplication is not, in general, commutative: If A and B aren x n matrices, 
then generally 


AB £ BA, 


as Examples 4, 5, and 6 show. 
Ann x n matrix is said to be invertible if there is an n x n matrix B such that 


AB=BA=I],, 
where 
100 0 
0 10 0 
152|90 9 1 0 
000... 1 


is the n x n identity matrix: J, has the property that Z, C = CI, = C for any n x n 
matrix C. We denote B by 47! and call A~! the inverse of A. The inverse, when it 
exists, is unique. 


EXAMPLE 9 Bia 


240 ppa 4 
ASO 2 Ly; then ae 3 4 —2|, 
3 0 2 01.6 12 4 
because 447! = J; = A`! A, as may be checked by matrix multiplication. A 


Methods of computing inverses are learned in linear algebra; we won't require 
these methods in this book. If A is invertible, the equation Ax = y can be solved for 
the vector x by multiplying both sides by 47! to obtain x = A7y. 


5In fact, Cramer's rule from Section 1.3 provides one way to invert matrices. Numerically more efficient methods based 
on elimination methods are learned in linear algebra or computer science. 


82 


The Geometry of Euclidean Space 


In Section 1.3, we defined the determinant of a 3 x 3 matrix. This can be gen- 
eralized by induction to n x n determinants. We illustrate here how to write the 
determinant of a 4 x 4 matrix in terms of the determinants of 3 x 3 matrices: 


ai Gi dij 14 


an 023 024 anı 023 024 
421 422 023 a4 Smala a " ala ds? d 
—apj|d32 433 434) — 2 |a31 43 
431 d32 4033 34 
442 A43 a44 a4, A43 A44 
44, A42 A43 A44 
421 an 024 an 422 423 
+413 |431 432 434] — 4141431 432 433 
44, a 44 a4 ap 443 


[see formula (2) of Section 1.3; the signs alternate +, —, +, —]. 

The basic properties of 3 x 3 determinants reviewed in Section 1.3 remain valid 
forn x n determinants. In particular, we note the fact that if A is an n x n matrix and 
B is the matrix formed by adding a scalar multiple of one row (or column) of A to 
another row (or, respectively, column) of A, then the determinant of A is equal to the 
determinant of B (see Example 10). 

A basic theorem of linear algebra states that an n x n matrix A is invertible if and 
only if the determinant of A is not zero. Another basic property is that the determinant 
is multiplicative: det (AB) = (det A)(det B). In this text, we shall not make use of 
many details of linear algebra, and so we shall leave these assertions unproved. 


EXAMPLE 10 Let 


-N =- 
——-—o0 
oomme 
Nero 


Find det A. Does A have an inverse? 
SOLUTION Adding (—1) x first column to the third column, we get 


1 0 i : 0 
zu 2 

1 ] 1 1 

1 2 


Adding (—1) x first column to the third column of this 3 x 3 determinant gives 


—2 0 


det A =|1 “11 


I) =] 


1 00 
o= =- 
1 


Thus, det 4 = —2 #0, and so A has an inverse. A 
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If we have three matrices A, B, and C such that the products AB and BC are 
defined, then the products (4 B)C and A(BC) are defined and are in fact equal (that 
is, matrix multiplication is associative). We call this the triple product of matrices 
and denote it by ABC. 


SXAMPLE 11 Let 


4n [t]. B= 1] and ex [7]. 


Then 


ABC = A(BC) = 5] Bie pi A 


EXAMPLE 12 


c nail doles 


The founder of modern (coordinate) geometry was René Descartes (see 
Figure 1.5.1), a great physicist, philosopher, and mathematician, as well as a 
founder of modern biology. 

Born in Touraine, France, in 1596, Descartes had a fascinating life. After 
studying law, he settled in Paris, where he developed an interest in mathe- 
matics. In 1628, he moved to Holland, where he wrote his only mathematical 
work, La Geometrie, one of the origins of modern coordinate geometry. 

Descartes had been highly critical of the geometry of the ancient Greeks, 
with all their undefined terms and with their proofs requiring ever newer and 
more ingenious approaches. For Descartes, this geometry was so tied to 
geometrical figures "that it can exercise the understanding only on condition 
of greatly fatiguing the imagination." He undertook to exploit, in geometry, 
the use of algebra, which had recently been developed. The result was La 
Geometrie, which made possible analytic or computational methods in 
geometry. 

Remember that the Greeks were, like Descartes, philosophers as well as 
mathematicians and physicists. Their answer to the question of the meaning 
of space was "Euclidean geometry." Descartes had therefore succeeded in 
"algebrizing"the Greek model of space. 

Gottfried Wilhelm Leibniz, cofounder (with Isaac Newton) of calculus, 
was also interested in "space analysis," but he did not think that Descartes" 
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Figure 1.5.1 René Descartes 
(1596-1650). 


algebra went far enough. Leibniz called for a direct method of space analysis 
(analysis situs) that could be interpreted as a call for the development of 
vector analysis. 

On September 8, 1679, Leibniz outlined his ideas in a letter to Christian 
Huygens: 


Tam still not satisfied with algebra, because it does not give the 
shortest methods or the most beautiful constructions in geometry. 
This is why I believe that, so far as geometry is concerned, we need 
still another analysis which is distinctly geometrical or linear and 
which will express situation (situs) directly as algebra expresses 
magnitude directly. And I believe that I have found the way and that 
we can represent figures and even machines and movements by 
characters, as algebra represents numbers or magnitudes. I am 
sending you an essay which seems to me to be important. 


In the essay, Leibniz described his ideas in greater detail: 


Ihave discovered certain elements of a new characteristic which is 
entirely different from algebra and which will have great advantages 
in representing to the mind, exactly and in a way faithful to its 
nature, even without figures, everything which depends on sense 
perception. Algebra is the characteristic for undetermined numbers 
of magnitudes only, but it does not express situation, angles, and 
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motion directly. Hence it is often difficult to analyze the properties 
of a figure by calculation, and still more difficult to find very 
convenient geometrical demonstrations and constructions, even 
when the algebraic calculation is completed. But this new 
characteristic, which follows the visual figures, cannot fail to give 
the solution, the construction, and the geometric demonstration all 
at the same time, and in a natural way and in one analysis, that is, 
through determined procedure. 


Leibniz'sideas influenced Hamilton and others. In the middle of the 
nineteenth century, Bolyai and Lobachevsky developed their “non-Euclidean” 
geometry, and Gauss studied and developed a theory of curved surfaces 
in three-dimensional space. Gauss developed two measures of curvature, 
the mean curvature and the Gauss curvature. For example, soap bubbles 
and soap films have constant mean curvature, whereas only soap bubbles 
have constant Gauss curvature. We discuss these ideas further in Section 7.7. 

Bernhard Riemann (see Figure 1.5.2), possibly the greatest 
mathematical genius of all time, gave an inaugural address in 1854 before the 
faculty of Góttingen University, entitled "On the Hypotheses Which Lie at 
the Foundation of Geometry." It was this monumental work that would lay 
the foundation, 50 years later, of Einstein'sgeneral theory of relativity. 
Riemann, like Leibniz and the early Greeks, was interested in space, 
especially its metric (or distance) properties. 


Figure 1.5.2 Bernhard Riemann 
(1826-1866). 
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Riemann called for the study of n-dimensional spaces and surfaces. He 
showed how to measure the curvature of three-, four-, ..., n-dimensional 
surfaces and (incredibly) showed that in order to be called “curved,” a 
surface need not be “curving” inside anything else; curvature was simply a 
consequence of the intrinsic “metric properties of space.” Once Riemann 
demonstrated that mathematical models permitted us to think of spaces 
of any dimension, the question naturally arose as to why our space is 
three-dimensional and not four-, five-, or more dimensional. Surprisingly, * 
noone has yet put forth a convincing explanation why, at the moment of 
creation, space became three-dimensional. 

Around 1910, Albert Einstein realized that gravity could be explained as 
a consequence of the curvature of a four-dimensional space-time (matter and 
energy curve space and time), and, thanks to Riemann, Einstein'sspace-time 
need not be enclosed in an ambient universe. Exactly how matter and energy 
curve space-time is the essence of Einstein'sfield equations in general 
relativity. In Section 7.7, we will discuss the ideas of curvature in greater 
depth and will indicate some of the ideas behind general relativity. The idea 
of n-dimensions also began to creep into mathematics from another, very 
different direction—from matrices. 

The definition of a matrix, as an isolated abstract object, is due to the 
English mathematician Arthur Cayley. Cayley was born in 1821, and in 1863 
was appointed Sedlesian Professor of Mathematics at Cambridge University. 
Around 1855, one year after Riemann'sinaugural address, Cayley, in an 
effort to simplify notation for his study of linear equations (as we saw in 
Section 1.5), introduced the abstract idea of a matrix of m columns and n 
rows. Naturally, a 1 x n matrix could also be viewed as a vector in an 
"n-dimensional space." 

After this concept took hold, mathematicians realized that they lost 
little in working in general dimensions, and the subject of modern linear 
algebra was off and running. Again, physics was to be a major impetus. 
Modern, abstract, linear algebra, including abstract vector spaces, began to 
turn up in textbooks after the appearance in 1918 of Hermann Weyl's 
Space-Time-Matter. 


EXERCISES 
1. Calculate the dot product of x = (1, —1, 0, 2) € R^ and y=(1,2,3,4 € R^. 
2. In R" show that 


(a) 2IIxI? + 2Ilyll? = lix + yll? + lix — yll? (This is known as the parallelogram law.) 


(6) Ix — yIlllx + yl < Ixl? + liy I 
(c) 4(x, y) = [lx + yll? — lx — yl? (This is called the polarization identity.) 


Interpret these results geometrically in terms of the parallelogram formed by x and y. 
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Verify the Cauchy-Schwarz inequality and the triangle inequality for the vectors in Exercises 
3 to 6. 


3. x = (2,0, —1), y = (4,0, —2) 
4. x =(1, 0, 2, 6), y = (3, 8, 4, 1) 


5. x- (Ll, CL, 1, 21, 1), y = (3, 0, 0, 0, 2) 


e 


x —(1,0,0, 1), y = (—1, 0, 0, 1) 


E 


Compute AB, det A, det B, det (AB), and det (A + B) for 


1 -1 0 —2 0 2 
A=|0 3 2 and B=|-1 1 -I1J. 
3 1 1 14 3 


Compute AB, det A, det B, det (AB), and det (A + B) for 


3 0 1 1 0 =I 
4z|12 -1 and B=|2 0 1 
10 1 


1 0 
9. Use induction on k to prove that if x;,..., x, € R”, then 
lxi - +++ xl] S Mill +--+ xl 
10. Prove using algebra, the identity of Lagrange: For real numbers x), . . . , Xn and 


yis««ssYn- 


5»! = (Ex) (5x) -Zer -z 


i<j 
Use this to give another proof of the Cauchy-Schwarz inequality in IR", 


11. Prove that if A is an n x n matrix, then 


(a) det (AA) = A" det A; and 
(b) if B is a matrix obtained from A by multiplying any row or column by a scalar A, 
then det B = A det A. 


In Exercises 12 to 14, A, B, and C denote n x n matrices. 
12. Is det (A + B) = det A + det B? Give a proof or counterexample. 
13. Does (4 + B)(A — B) = A? — B? 


14. Assuming the law det (4B) = (det A)(det B), prove that det (ABC) = 
(det A)(det B)(det C). 
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15. (This exercise assumes a knowledge of integration of continuous functions of one 
variable.) Note that the proof of the Cauchy-Schwarz inequality (Theorem 4) depends only 
on the properties of the inner product listed in Theorem 1. Use this observation to establish 
the following inequality for continuous functions f, g: [0, 1] > R: 


< i f OP dx | [ ‘eR dx. 


(a) verifying that the space of continuous functions from [0, 1] to R forms a vector 
space; that is, we may think of functions /, g abstractly as “vectors” that can be added to each 
other and multiplied by scalars. 

(b) introducing the inner product of functions 


1 
| [ SOE) dx 


Do this by 


1 
fia [| feeds 
0 
and verifying that it satisfies conditions (i) to (iv) of Theorem 3. 
16. Define the transpose A’ of ann x n matrix A as follows: the ijth element of A” is aj 


where a;; is the ijth entry of A. Show that AT is characterized by the following property: For 
all x, y in R”, 


(A x): y = x (Ay). 
17. Verify that the inverse of 
a b is l d —b 
c d ad —bc|-c af 


18. Use your answer in Exercise 17 to show that the solution of the system 


ax +by =e 
cx+dy=f 


x] | ! d —bl[e 

y| ad—bc|-c all fy 
19. Assuming the law det (AB) = (det A)(det B), verify that (det 4)(det 47!) = 1 and 
conclude that if A has an inverse, then det A 4 0. 


REVIEW EXERCISES FOR CHAPTER 1 


1. Let v = 3i 4- 4j - 5k and w = i — j + k. Compute v + w, 3v, 6v + 8w, —2v, v- w, 
v x w. Interpret each operation geometrically by graphing the vectors. 


2. Repeat Exercise 1 with v = 2j + k and w = —i — k. 
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3. (a) Find the equation of the line through (— 1, 2, —1) in the direction of j. 
(b) Find the equation of the line passing through (0, 2, — 1) and (—3, 1, 0). 
(c) Find the equation for the plane perpendicular to the vector (—2, 1, 2) and passing 
through the point (—1, 1, 3). 


4. (a) Find the equation of the line through (0, 1, 0) in the direction of 3i 4- k. 
(b) Find the equation of the line passing through (0, 1, 1) and (0, 1, 0). 
(c) Find an equation for the plane perpendicular to the vector (—1, 1, — 1) and passing 
through the point (1, 1, 1). 
5. Compute v - w for the following sets of vectors: 


(a) v= —i +j;w =k. 
(b) v=i+2j —k;w=3i+j. 
(c) v= —2i—j+k;w = 3i + 2j — 2k. 
6. Compute v x w for the vectors in Exercise 5. [Only part (b) is solved in the Study Guide.] 


7. Find the cosine of the angle between the vectors in Exercise 5. [Only part (b) is solved in 
the Study Guide.] 


8. Find the area of the parallelogram spanned by the vectors in Exercise 5. [Only part (b) is 
solved in the Study Guide.] 


9. Use vector notation to describe the triangle in space whose vertices are the origin and the 
endpoints of vectors a and b. 


10. Show that three vectors a, b, c lie in the same plane through the origin if and only if there 
are three scalars a, B, y, not all zero, such that ~a + Bb + yc = 0. 


11. For real numbers aj, a2, a3, bi, b2, b3, show that 
(aibi + agb; + asby. < (ai +a} -- abi + b3 + b3). 


12. Letu, v, w be unit vectors that are orthogonal to each other. If a = au + £v + yw, show 
that 


o —a-u, pB-a:v, y=a-w. 
Interpret the results geometrically. 


13. Let a, b be two vectors in the plane, a = (a), a2). b = (51, b2), and let À be a real 
number. Show that the area of the parallelogram determined by a and b + Aa is the same as 
that determined by a and b. Sketch. Relate this result to a known property of determinants. 


14. Find the volume of the parallelepiped determined by the vertices (0, 1, 0), (1, 1, 1), 
(0, 2, 0), (3, 1, 2). 


15. Given nonzero vectors a and b in IR?, show that the vector v = [[a||b + ||b|[a bisects the 
angle between a and b. 
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16. Use vector methods to prove that the distance from the point (x, yı) to the line 
ax 4 by-cis 
laxi + by: — cl 
lab ~ 


17. Verify that the direction of b x c is given by the right-hand rule, by choosing b, c to be 
two of the vectors i, j, and k. 


18. (a) Suppose a- b = a'- b for all b. Show that a = a’. 
(b) Suppose a x b = a’ x b for all b. Is it true that a = a’? 


19. (a) Using vector methods, show that the distance between two nonparallel lines /, and /; 
is given by 
dz Kv? — vi) - (ài x a2)| 

lai x a2|| 


where v;, V2 are any two points on /; and /;, respectively, and a, and a; are the directions of /; 
and l2. [Hint: Consider the plane through /; that is parallel to /;. Show that the vector 
(aj x a2)/||ai x a2|| is a unit normal for this plane; now project v2 — v; onto this normal 
direction.] 

(b) Find the distance between the line /; determined by the points (—1, —1, 1) and 
(0, 0, 0) and the line /; determined by the points (0, —2, 0) and (2, 0, 5). 


20. Show that two planes given by the equations Ax + By + Cz + D, = 0 and 
Ax + By + Cz + D; = O are parallel, and that the distance between them is 
|D; — Dj 
[Az + B+C 
21. (a) Prove that the area of the triangle in the plane with vertices (xi, yi), (xo. 2), (x3, y3) 
is the absolute value of 


L rf d 
zia X2 xj. 
M» Je 5 


(b) Find the area of the triangle with vertices (1, 2), (0, 1), (—1, 1). 


22. Convert the following points from Cartesian to cylindrical and spherical coordinates and 
plot: 


(a) (0,3, 4) (d) (—1, 0, 1) 
(b) (—V2, 1,0) (e) (-2V3, —2, 3) 
(c) (0, 0, 0) 


23. Convert the following points from cylindrical to Cartesian and spherical coordinates and 
plot: 


(a) (1, 77/4, 1) (b) (3, 2/6, —4) 
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(c) (0, 77/4, 1) (e) (—2, —/2, 1) 
(d) (2, —7/2, 1) 


24. Convert the following points from spherical to Cartesian and cylindrical coordinates and 
plot: 


(a) (1, 2/2, x) (d) (2, —7/2, —1) 
(b) (2, —7/2, 1/6) (e) (-1, x, 2/6) 
(c) (0, 7/8, 2/35) 
25. Rewrite the equation z = x? — y? using cylindrical and spherical coordinates. 
26. Using spherical coordinates, show that 
u-k 
$ = cos! (5) 
llull 
where u = xi + yj + zk. Interpret geometrically. 
27. Verify the Cauchy-Schwarz and triangle inequalities for 


x=(3,2,1,0) and  y-(L1,1,2) 


28. Multiply the matrices 


Does AB — BA? 
29. (a) Show that for two n x n matrices A and B, and x € R”, 
(AB)x — A(Bx). 


(b) What does the equality in part (a) imply about the relationship between the 
composition of the mappings x +> Bx, y +> Ay, and matrix multiplication? 


30. Find the volume of the parallelepiped spanned by the vectors 
(1,0,1). (1,1, D, and (—3,2, 0). 


31. (For students with some knowledge of linear algebra.) Verify that a linear mapping T of 
IR" to IR" is determined by an n x n matrix. 


32. Find an equation for the plane that contains (3, — 1, 2) and the line with equation 
v = (2, 21,0) + ¢(2, 3, 0). 


33. The work W done in moving an object from (0, 0) to (7, 2) subject to a constant force F 
is W = F-r, where r is the vector with its head at (7, 2) and tail at (0, 0). The units are feet 
and pounds. 
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(a) Suppose the force F = 10 cos 6i + 10 sin 0j. Find W in terms of 0. 
(b) Suppose the force F has magnitude of 6 Ib and makes an angle of 7/6 rad with the 
horizontal, pointing right. Find W in foot-pounds. 


34. Ifa particle with mass m moves with velocity v, its momentum is p = mv. In a game of 
marbles, a marble with mass 2 grams (g) is shot with velocity 2 meters per second (m/s), hits 
two marbles with mass 1 g each, and comes to a dead halt. One of the marbles flies off with 
a velocity of 3 m/s at an angle of 45* to the incident direction of the larger marble as in 
Figure 1.R.1. Assuming that the total momentum before and after the collision is the same 
(according to the law of conservation of momentum), at what angle and speed does the 
second marble move? 


36. Show that 


NO 
* 
i 
th 
e 


if x, y, and z are all different. 
37. Show that 


66 628 246 68 627 247 
88 435 .24|—|86 436 23. 
2 =l l 2 =l 1 


38. Show that 
n nl n+2 
n+3 n+4 n+5 
n+6 n+7 n+8 


has the same value no matter what n is. What is this value? 
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39. The volume of a tetrahedron with concurrent edges a, b, c is given by V = ja * (b x €). 


(a) Express the volume as a determinant. 
(b) Evaluate V when a =i+j+k,b=i—j+k,c=i+j. 


Use the following definition for Exercises 40 and 41: Let vi, . .. , v, be vectors in R? from 0 to 


the masses my, ..., Mn. The center of mass is the vector 
rei HE 
izl i*i 
E n 
Mim 


40. A tetrahedron sits in xyz coordinates with one vertex at (0, 0, 0), and the three edges 
concurrent at (0, 0, 0) are coincident with the vectors a, b, c. 


(a) Draw a figure and label the heads of the vectors a, b, c. 
(b) Find the center of mass of each of the four triangular faces of the tetrahedron if a 
unit mass is placed at each vertex. 


41. Show that for any vector r, the center of mass of a system satisfies 


" à 
3 mille ril? = Y mir — cl? + mlir — elt. 
i=l i=l 


where m = Y 5; , m; is the total mass of the system. 

In Exercises 42 to 47, find a unit vector that has the given property. 

42. Parallel to the line x = 3t + 1, y = 16t — 2, z = —(t + 2). 

43. Orthogonal to the plane x — 6y + z = 12. 

44. Parallel to both the planes 8x + y +z = 1 and x —y—z=0. 

45. Orthogonal to i+ 2j — k and to k. 

46. Orthogonal to the line x = 2t — 1, y = —t — l,z = t + 2, and the vector i — j. 


47. At an angle of 30° to i and making equal angles with j and k. 


e 
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I turn away with fright and horror from the lamentable evil of functions 
which do not have derivatives. 


Charles, Hermite, 


in a letter to Thomas Jan Stieltjes 


his chapter extends the principles of differential calculus for functions of one 

variable to functions of several variables. We begin in Section 2.1 with the ge- 
ometry of real-valued functions and study the graphs of these functions as an aid 
in visualizing them. Section 2.2 gives some basic definitions relating to limits and 
continuity. This subject is treated briefly, because it requires time and mathematical 
maturity to develop fully and is therefore best left to a more advanced course. For- 
tunately, a complete understanding of all the subtleties of the limit concept is not 
necessary for our purposes; the student who has difficulty with this section should 
bear this in mind. However, we hasten to add that the notion of a limit is central to 
the definition of the derivative, but not to the computation of most derivatives in spe- 
cific problems, as we already know from one-variable calculus. Sections 2.3 and 2.5 
deal with the definition of the derivative, and establish some basic rules of calculus: 
namely, how to differentiate a sum, product, quotient, or composition. In Section 2.6, 
we study directional derivatives and tangent planes, relating these ideas to those in 
Section 2.1. Finally, the Internet supplement gives some of the technical proofs. 

In generalizing calculus from one dimension to several, it is often convenient 
to use the language of matrix algebra. What we shall need has been summarized in 
Section 1.5. 


2.1 The Geometry of Real-Valued Functions 


We launch our investigation of real-valued functions by developing methods for vi- 
sualizing them. In particular, we introduce the notions of a graph, a level curve, and 
a level surface of such functions. 
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Functions and Mappings 


Let f be a function whose domain is a subset A of IR" and with a range contained 
in R”. By this we mean that to each x = (xi, ..., Xn) € A, f assigns a value f(x), 
an m-tuple in R”. Such functions f are called vector-valued functions! if m > 1, 
and scalar-valued functions if m — 1. For example, the scalar-valued function 
fŒ, y, z) = (x? + y? + z2) 9"? maps the set A of (x, y, z) # (0, 0, 0) in R? (n —3 
in this case) to R (m = 1). To denote f we sometimes write 


fi wnae Q y) 4277, 


Note that in IR? we often use the notation (x, y, z) instead of (x1, x», x3). In general, 
the notation x +> f(x) is useful for indicating the value to which a point x € R” is 
sent. We write f: A C R” — R” to signify that A is the domain of f(a subset of IR") 
and the range is contained in R". We also use the expression f maps A into IR". Such 
functions f are called functions of several variables if A C R",n > 1. 

As another example we can take the vector-valued function g: IRÉ — IR? defined 
by the rule 


g(x) = g(x1, X2, X3, X4, X5, x6) = (xixaxaxaxsxs, vx? +32). 


The first coordinate of the value of g at x is the product of the coordinates of x. 
Functions from IR" to R” are not just mathematical abstractions, they arise natu- 
rally in problems studied in all the sciences. For example, to specify the temperature 
T in a region A of space requires a function T: A C R? > R (n = 3, m = 1); thus, 
T(x, y, z) is the temperature at the point (x, y, z). To specify the velocity of a fluid 
moving in space requires a map V: R^ — IR?, where V(x, y, z, t) is the velocity 
vector of the fluid at the point (x, y, z) in space at time ¢ (see Figure 2.1.1). To 


Figure 2.1.1 A fluid in motion defines a 
vector field V by specifying the velocity of 
the fluid particles at each point in space 
and time. 


! Some mathematicians would write such an f in boldface, using the notation f(x), because the function is vector-valued. 
We did not do so, as a matter of personal taste. We use boldface primarily for mappings that are vector fields, introduced 
later. The noticn of function was developed over many centuries, with the definition extended to cover more cases as 
they arose. For example, in 1667 James Gregory defined a function as “a quantity obtained from other quantities by a 
succession of algebraic operations or by any other operation imaginable.” In 1755 Euler gave the following definition: 
“If some quantities depend on others in such a way as to undergo variation when the latter are varied then the former are 
called functions of the latter.” 
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specify the reaction rate of a solution consisting of six reacting chemicals 
A, B, C, D, E, F in proportions x, y, z, w, u, v requires a map o: U C RÉ > R, 
where c (x, y, Z, w, u, v) gives the rate when the chemicals are in the indicated pro- 
portions. To specify the cardiac vector (the vector giving the magnitude and di- 
rection of electric current flow in the heart) at time ¢ requires a map c: R > R?, 
t e e(t). 

When f: U c R" — R, we say that f is a real-valued function of n variables 
with domain U. The reason we say "n variables" is simply that we regard the co- 
ordinates of a point x = (x1, ..., Xn) € U as n variables, and f(x) = f(xi,..., Xn) 
depends on these variables. We say “real-valued” because f(xi,...,x,) is a real 
number. A good deal of our work will be with real-valued functions, so we give them 
special attention. 


Graphs of Functions 


For f: U C R 5 R (n = 1), the graph of f is the subset of R? consisting of all 
points (x, f(x)) in the plane, for x in U. This subset can be thought of as a curve in 
R?. In symbols, we write this as 


graph f = (Gr, f(x)) € R? | x € UJ}, 
where the curly braces mean “the set of all” and the vertical bar is read “such that.” 


Drawing the graph of a function of one variable is a useful device to help visualize 
how the function actually behaves. (See Figure 2.1.2.) It will be helpful to generalize 


Graph of f 


Graph of f 


(a) 


Figure 2.1.2 The graphs of (a) a function of one variable, and (b) a function of two 
variables. 
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the idea of a graph to functions of several variables. This leads to the following 
definition: 


DEFINITION: Graph of a Function Let f: U C IR" — R. Define the 
graph of f to be the subset of R”*! consisting of all the points 


Qa, <--> Xn fŒ, - - x8) 


in R"*! for (xi, ..., Xn) in U. In symbols, 


graph f = (Qa, -- -s Xn, fx, -s Xn)) € R"*! | xi, Xn) € U). 


For the case n = 1 the graph is a curve in R?, while for n = 2 it is a surface in 
R? (see Figure 2.1.2). For n = 3 it is difficult to visualize the graph, because, since 
we are humans living in a three-dimensional world, it is hard for us to envisage sets 
in R^. To help overcome this handicap, we introduce the idea of a level set. 


Level Sets, Curves, and Surfaces 


Suppose f(x, y, z) — x? + y? -- z?. A level set is a subset of R? on which f is 
constant; for instance, the set where x? + y? + z? = 1 is a level set for f. This we can 
visualize: It is just a sphere of radius 1 in R?. Formally, a level set is the set of (x, y, z) 
such that f(x, y, z) = c, where c is a constant. The behavior or structure of a function 
is determined in part by the shape of its level sets; consequently, understanding these 
sets aids us in understanding the function in question. Level sets are also useful for 
understanding functions of two variables f(x, y), in which case we speak of level 
curves or level contours. 

The idea is similar to that used to prepare contour maps, where one draws lines 
to represent constant altitudes; walking along such a line would mean walking on a 
level path. In the case of a hill rising from the xy plane, a graph of all the level curves 
gives us a good idea of the function A(x, y), which represents the height of the hill at 
point (x, y) (see Figure 2.1.3). 


Figure 2.1.3 Level contours of a function are defined in the same manner as 
contour lines for a topographical map. 
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DEISE The constant function f: R? > R, (x, y) + 2, that is, the func- 
tion f(x, y) = 2, has as its graph the horizontal plane z = 2 in IR?. The level curve 
of value c is empty if c # 2, and is the whole xy plane ifc —2. A 


The function f: R? — R, defined by f(x, y) =x + y + 2 has as 
its graph the inclined plane z = x + y + 2. This plane intersects the xy plane (z = 0) 
in the line y = —x — 2 and the z axis at the point (0, 0, 2). For any value c € R, the 
level curve of value c is the straight line y = —x + (c — 2); or in symbols, the set 


Le = {(x, y) | y = —x + (c — 2)} CR’. 


We indicate a few of the level curves of the function in Figure 2.1.4. This is a contour 
map of the function f. 


Figure 2.1.4 The level curves of f(x, y) 2x +y +2 
show the sets on which f takes a given value. 


Line of 
intersection of 
planez=x+y+2 
and the xy plane 


From level curves labeled with the value or “height” of the function, the shape of 
the graph may be inferred by mentally elevating each level curve to the appropriate 
height, without stretching, tilting, or sliding it. If this procedure is visualized for all 
level curves, Le, that is, for all values c € R, they will assemble to give the entire 
graph of f, as indicated by the shaded plane in Figure 2.1.5. If the graph is visualized 


Level curve lifted 
to surface 


Level curve lifted 
to surface 


Figure 2.1.5 The relationship 
Level curvex+y+2=0 of level curves of Figure 2.1.4 
P< to the graph of the function 
77 x+y42=2 in xy plane S(x,y) =x +y +2, which is 
E y the plane z = x + y 4- 2. 
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using a finite number of level curves, a contour model is produced. If f is a smooth 
function, its graph will be a smooth surface, and so the contour model, mentally 
smoothed over, gives a good impression of the graph. A 


DEFINITION: Level Curves and Surfaces Let f: U C R"— R and 
let c € R. Then the /evel set of value c is defined to be the set of those points 
x € U at which f(x) — c. If n — 2, we speak of a level curve (of value c); 
and if n — 3, we speak of a level surface. In symbols, the level set of value 
c is written 


{xeU| f()ac) CR’. 


Note that the level set is always in the domain space. 


DISH 5) Describe the graph of the quadratic function 
f: >R, œ, y) x+y. 


SOLUTION The graph is the paraboloid of revolution z = x? + y?, oriented 
upward from the origin, around the z axis. The level curve of value c is empty for 
c < 0; for c > 0 the level curve of value c is the set ((x, y) | x? + y? = c}, a circle 
of radius 4/c centered at the origin. Thus, raised to height c above the xy plane, the 
level set is a circle of radius ./c, indicating a parabolic shape (see Figures 2.1.6 and 
2.1.7). A 


Figure 2.1.6 Some level curves for the function 
Sæ) =x +y. 


=32 
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x2 + y2= 42 


Figure 2.1.7 Level curves in Figure 2.1.6 raised to the 
graph. 


The Method of Sections 


By a section of the graph of f we mean the intersection of the graph and a (vertical) 
plane. For example, if P, is the xz plane in R3, defined by y = 0, then the section of 
f in Example 3 is the set 


Pi N graph f = (6, y, z) |y 20,22 x°}, 


which is a parabola in the xz plane. Similarly, if P; denotes the yz plane, defined by 
x = 0, then the section 


P,0 graph f = {(x, y,z) |x =0,z =y} 


is a parabola in the yz plane (see Figure 2.1.8). It is usually helpful to compute at 
least one section to complement the information given by the level sets. 


^WIIUEE] The graph of the quadratic function 
f:R >R, (x, y)exi-y 


is called a hyperbolic paraboloid, or saddle, centered at the origin. Sketch the graph. 


SOLUTION To visualize this surface, we first draw the level curves. To de- 
termine the level curves, we solve the equation x? — y? — c. Consider the values 
c = 0, +1, +4. For c = 0, we have y? = x?, or y = +x, so that this level set consists 
of two straight lines through the origin. For c = 1, the level curve is x? — y? = 1, or 
y = ++/x? — 1, which is a hyperbola that passes vertically through the x axis at the 
points (+1, 0) (see Figure 2.1.9). Similarly, for c = 4, the level curve is defined by 
y = xx? — 4, the hyperbola passing vertically through the x axis at (+2, 0). For 


2.1 The Geometry of Real-Valued Functions 101 


$,::z723,y-0 


$;::z-y2,x-0 
Figure 2.1.8 Two sections of the graph 
of f(x, y) 2 x* + y?. 


c = —1, we obtain the curve x? — y? = —1, that is, x = +,/y? — 1, the hyperbola 
passing horizontally through the y axis at (0, +1). And for c = —4, the hyperbola 
through (0, +2) is obtained. These level curves are shown in Figure 2.1.9. Because it 
is not easy to visualize the graph of f from these data alone, we shall compute two 
sections, as in the previous example. For the section in the xz plane, we have 


P, N graph of f = {(x, y, z) | y 20,z = x°}, 
which is a parabola opening upward; and for the yz plane, 
P.M graph f = {(x, y, z) | x = 0,z = —y?), 


which is a parabola opening downward. The graph may now be visualized by lifting the 
level curves to the appropriate heights and smoothing out the resulting surface. Their 


Figure 2.1.9 Level curves for the function 
fœ y=- y. 
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placement is aided by computing the parabolic sections. This procedure generates 
the hyperbolic saddle indicated in Figure 2.1.10. Compare this with the computer- 
generated graphs in Figure 2.1.11 (note that the orientation of the axes has been 
changed). A 


x-y =1};z=1 


Figure 2.1.10 Some level curves on the 
RAPER graph of f(x, y) =x? — y. 


y axis 


Tags -=l 


x axis 


Figure 2.1.11 The graph of z = x? — y? and its level curves. 


DAAA] Describe the level sets of the function 


f: R —RQyngexseyexz 


SOLUTION This is the three-dimensional analogue of Example 3. In this context, 
level sets are surfaces in the three-dimensional domain IR?. The graph, in IR^, cannot 
be visualized directly, but sections can nevertheless be computed. 

The level set with value c is the set 


Le = (yz) x? +y? +2 =c), 
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which is the sphere centered at the origin with radius ./c for c > 0, is a single point 
at the origin for c — 0, and is empty for c « 0. The level sets for c — 0, 1, 4, and 9 
are indicated in Figure 2.1.12. A 


x+y +=]? 


xsxy2-( 


x+y242=72 


go tt eae 
Figure 2.1.12 Some level surfaces for 
fe, y z) =x +y 427. 


DVi BA] Describe the graph of the function f: R? > R defined by 
f(x, y, Z) =x? + y? — z^, which is the three-dimensional analogue of Example 4, 
and is also called a saddle. 

SOLUTION Formally, the graph of f is a subset of four-dimensional space. If we 
denote points in this space by (x, y, z, t), then the graph is given by 


(yz trm y! -z). 
The level surfaces of f are defined by 
Le = (x,y,z) |x? +y -2 =e}. 


For c = 0, this is the cone z = +,/x? + y? centered on the z axis. For c negative, 
say, c = —a?, we obtain z = +,/x? + y? + a?, which is a hyperboloid of two sheets 
around the z axis, passing through the z axis at the points (0, 0, +a). For c positive, 
say, c = b?, the level surface is the single-sheeted hyperboloid of revolution around 
the z axis defined by z = -++,/x? + y? — b?, which intersects the x y plane in the circle 
of radius |b|. These level surfaces are sketched in Figure 2.1.13. 

Another view of the graph may be obtained from a section. For example, the 
subspace Sy = {(x, y, zZ, t) | y = 0} intersects the graph in the section 


Sy-o N graph f = {(x, y, z, t) | y 20, t2 x^ — 2°}, 


that is, the set of points of the form (x, 0, z, x? — z?), which may be considered to be 
a surface in xzt space (see Figure 2.1.14). A 
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Figure 2.1.13 Some level surfaces of the 
function f(x, y, z) =x? + y? - z. 


gure 2.1.14 The y — 0 section 
the graph of f(x, y, z) — 
+y -z. 


We have seen how the methods of sections and level sets can be used to understand 
the behavior of a function and its graph; these techniques can be quite useful to 
people who desire comprehensive visualization of complicated data. There are many 
computer programs available to do this, and we show the results of one such program 
in Figure 2.1.15. 
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yaxis 0 


Figure 2.1.15 Computer-generated graph of z = (x? + 3?) exp(1 — x? — y?) 
represented in three ways: (a) by sections, (b) by level curves on a graph, and 
(c) by level curves in the xy plane. 


EXERCISES 
1. Sketch the level curves and graphs of the following functions: 
(a) f: R —> R, (x, y) x -y 2 (c) f: R >R, (x, y) 5 —xy 
(b) f: R? > R, (x, y) x? + 4y? 
2. Describe the behavior, as c varies, of the level curve f(x, y) — c for each of these 
functions: 
() fæ y =x +y +l b) fey-l-x-» (o)fQy-s-x 


3. For the functions in Examples 2, 3, and 4, compute the section of the graph defined by 
the plane 


So = (Gr, y, z) | y = x tan} 


for a given constant 0. Do this by expressing z as a function of r, where x = r cos6, y = 
r sin 0. Determine which of these functions f have the property that the shape of the section 
S, graph f is independent of 6. (The solution for Example 3 only is in the Study Guide.) 
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In Exercises 4 to 10, draw the level curves (in the xy plane) for the given function f and 
specified values of c. Sketch the graph of z — f(x, y). 


4. fQy)- 


4—3x^F2y, c0, 1,2, 3, —1, —2, 23 


5. f(x, y) = (100 — x? — y2)2, c = 0,2,4,6,8,10 


6. fi. y)-— 


(x? 4-2), c =0, 1, 2,3,4,5 


T. S(x,y) =x? -yc20,1,2,3,4,5 


8. f(x,y) = 


3x — Ty, ¢ =0, 1, 2,3, 21, 22, -3 


9. f(x, y) =x? +xy,c=0, 1,2, 3, -1, 22, -3 


10. f(x, y) 


x[y, c —0,1,2,3, 1, 2, 3 


In Exercises 11 to 13, sketch or describe the level surfaces and a section of the graph of each 


function. 
11. f: R —> 
12. f: B= 


13. f: R => 


R, (yz) —à-y-z 
R, (x, y, z) œ> 4x? + y? + 927 


R, (x,y,z) x? + y? 


In Exercises 14 to 18, describe the graph of each function by computing some level sets and 


sections. 

M. f: R > 
15. f: R > 
16. f: R? > 


17. f: R > 


R, (x, y, z) > xy 
R, (x, y, z) =œ xy + yz 


R, (x, y, Z) > xy +27 


R, (x, y) e» Iyl 


18. f: R? > R, (x, y) e^ max (Ixl, Iyl) 


Sketch or describe the surfaces in R? of the equations presented in Exercises 19 to 31. 


19. 4x? + y? = 16 20. x - 22-4 
21. 2-2 y? +4 22. x! y - 2x 20 
2 2 2 2 2 
x y z y Z x 
23. — — +> 24. —+—=14+— 
474*5 yg tie 


26. y? +2? =4 
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2 2 
y x Ses doo 
27. z= — - — 28. -x*-4z 
774 779 4 
x? y? z 
.4x?—3y? 2- 30. —+—+—=1 
29. 4x? —3y* 4-27 20 9*12*5 


31. x? +y? +27 + Ax — by + 9z — b = 0, where b is a constant 
32. Using polar coordinates, describe the level curves of the function defined by 
S(x,y) = 2xy[G* +3°) if (x, y) # (0, 0) and /(0,0) = 0. 


33. Let f: R?\{0} — R be given in polar coordinates by f(r, 0) = (cos 20)/r?. Sketch a 
few level curves in the xy plane. Here, R?\{0} = {x € R? | x 4 0}. 


34. Show that in Figure 2.1.15, the level "curve" z = 3 consists of two points. 
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Open Sets 


This section develops the concepts of open sets, limits, and continuity; open sets are 
needed to understand limits, and limits are in turn needed to understand continuity 
and differentiability. 

As in elementary calculus, it is not necessary to completely master the limit 
concept in order to work problems in differentiation. For this reason, instructors may 
treat the following material with varying degrees of rigor. The student should consult 
with the instructor about the depth of understanding required. 


We begin formulating the concept of an open set by defining an open disk. Let xp € IR" 
and letr be a positive real number. The open disk (or open ball) of radius r and center 
Xo is defined to be the set of all points x such that ||x — xo|| < r. This set is denoted 
D, (xo), and is the set of points x in IR" whose distance from xy is less than r. Notice 
that we include only those x for which strict inequality holds. The disk D,(xo) is 
illustrated in Figure 2.2.1 for n = 1, 2, 3. For the case n = 1 and xo € R, the open 
disk D, (xo) is the open interval (xo — r, xo + r), which consists of all numbers x € R 
strictly between xo — r and xo +r. For the casen = 2, xo € R?, D, (xo) is the “inside” 
of the disk of radius r centered at xo. For the case n = 3, xo € R?, D, (xo) is the part 
strictly “inside” of the ball of radius r centered at xo. 


DEFINITION: Open Sets Let U C R” (that is, let U be a subset of R”). 
We call U an open set when for every point xo in U there exists some r > 0 
such that D, (xo) is contained within U; symbolically, we write D,(xo) C U (see 
Figure 2.2.2). 
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D,(Xo) 
D,(xo) 
P cc M —»y 
n=1 8-2 x: n=3 
(a) (b) (c) 
Figure 2.2.1 What disks D, (xo) look like in (a) one, (b) two, and (c) three 
dimensions. 


The number r > 0 can depend on the point xo, and generally r will shrink as 
xo gets closer to the “edge” of U. Intuitively speaking, a set U is open when the 
"boundary" points of U do not lie in U. In Figure 2.2.2, the dashed line is not 
included in U. 

We establish the convention that the empty set Ø (the set consisting of no ele- 
ments) is open. 

We have defined an open disk and an open set. From our choice of terms it would 
seem that an open disk should also be an open set. A little thought shows that this 
fact requires some proof. The following theorem does this. 


Figure 2.2.2 An open set U is one that completely 
encloses some disk D, (xo) about each of its points xo. 
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THEOREM 1 For each xp € R” andr > 0, D, (xo) is an open set. 


PROOF  Letx € D,(xo), that is, let |x — xo|| < r. According to the definition of an 
open set, we must find an s > 0 such that D,(x) C D, (xo). Referring to Figure 2.2.3, 
we see that s =r — ||x — xo|| is a reasonable choice; note that s > 0, but that s 
becomes smaller if x is nearer the edge of D, (xo). 


Figure 2.2.3 The geometry of the proof that 


d= Ix- Xoll ew 
an open disk is an open set. 


s =r —- |IX - xoll 


To prove that D,(x) C D,(xo), let y € D,(x); that is, let ||y — x|| < s. We want 
to prove that y € D,(xo) as well. Proving this, in view of the definition of an r-disk, 
entails showing that ||y — xo|| < r. This is done by using the triangle inequality for 
vectors in IR": 


lly — xoll = ly — x) + (x — xo)ll < lly — xl] + IIx — xoll < s + lix — xoll =r. 
Hence, |l — xoll <r. m 


The following example illustrates some techniques that are useful in establishing 
the openness of sets. 


ID CWIIUEDEE Prove that A = ((x, y) € R? | x > 0} is an open set. 


SOLUTION The set is pictured in Figure 2.2.4. 

Intuitively, this set is open, because no points on the “boundary,” x = 0, are 
contained in the set. Such an argument will often suffice after one becomes accustomed 
to the concept of openness. At first, however, we should give details. To prove that 
A is open, we show that for every point (x, y) € A there exists an r > 0 such that 
D,(x, y) C A. If (x, y) € A, then x > 0. Choose r = x. If (xi, y1) € D,(x, y), we 
have 


bii =x] = Vix x VQ -xy- Qi -») «rx, 


and so xı — x <x and x — xı < x. The latter inequality implies x; > 0, that is, 
(x1, y1) € A. Hence D,(x, y) C A, and therefore A is open (see Figure 2.2.5). A 
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Figure 2.2.4 Show that A is an open set. 


Figure 2.2.5 The construction of a disk about a point in A 
that is completely enclosed in A. 


E 


It is useful to have a special name for an open set containing a given point x, 
because this idea arises often in the study of limits and continuity. Thus, by a neigh- 
borhood ofx € IR" we merely mean an open set U containing the point x. For example, 
D, (xo) is a neighborhood of x for any r > 0. The set A in Example 1 is a neighbor- 
hood of the point xo = (3, —10). 


Boundary 


Let us formally introduce the concept of a boundary point, which we alluded to in 
Example 1. 


DEFINITION: Boundary Points Let A C R". A point x € R" is called a 
boundary point of A if every neighborhood of x contains at least one point in 4 


and at least one point not in A. 


In this definition, x itself may or may not be in A; if x € A, then x is a boundary 
point if every neighborhood of x contains at least one point not in A (it already 
contains a point of 4, namely, x). Similarly, if x is not in 4, it is a boundary point if 
every neighborhood of x contains at least one point of A. 
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We shall be particularly interested in boundary points of open sets. By the defi- 
nition of an open set, no point of an open set A can be a boundary point of A4. Thus, 
a point x is a boundary point of an open set A if and only if x is not in A and every 
neighborhood of x has a nonempty intersection with A. 

This expresses in precise terms the intuitive idea that a boundary point of A is a 
point just on the “edge” of A. In many examples it is perfectly clear what the boundary 
points are. 


Dy (a) Let A = (a, b) in R. Then the boundary points of A consist 
of the points a and b. A consideration of Figure 2.2.6 and the definition will make 
this clear. [The reader will be asked to prove this in Exercise 20(c).] 


£C Boundary points S. 
x Figure 2.2.6 The boundary points of the interval (a, b). 
b 


a 


(b) Let A = D,(xo, yo) be an r-disk about (xo, yo) in the plane. The boundary 
consists of points (x, y) with (x — xo)? + (y — yo)” =r? (Figure 2.2.7). 


Boundary 


Figure 2.2.7 The boundary of A consists of points on the edge 
of A. 


DG y) - 4 


(c) Let A = ((x, y) € R? | x > 0}. Then the boundary of A consists of all points 
on the y axis (the student should draw a figure). 


(d) Let A be D, (xo) minus the point xo (a “punctured” disk about xo). Then xo is a 
boundary point of 4. A 


Limits 
We now turn our attention to the concept of a limit. Throughout the following discus- 
sions the domain of definition of the function f will be an open set A. We are interested 
in finding the limit of f as x € A approaches either a point of A or a boundary point 
of A. 
The reader should appreciate the fact that the limit concept is a basic and use- 
ful tool for the analysis of functions; it enables us to study derivatives, and hence 
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maxima and minima, asymptotes, improper integrals, and other important features of 
functions, as well as being useful for infinite series and sequences. We will present a 
theory of limits for functions of several variables that includes the theory for functions 
of one variable as a special case. 

In one-variable calculus, the student has encountered the notion of limit f(x)21 
for a function f: A C IR — R from a subset A of the real numbers to the real 
numbers. Intuitively, this means that as x gets closer and closer to xo, the val- 
ues f(x) get closer and closer to (the limiting value) /. To put this intuitive idea 
on a firm, mathematical foundation, either the “epsilon (£) and delta (3) method” 
or the "neighborhood method" is usually introduced. The same is true for func- 
tions of several variables. In what follows we develop the neighborhood approach 
to limits. The epsilon-delta approach is left for optional study at the end of this 
section. 


DEFINITION: Limit Let f: A C R” — R”, where A is an open set. Let 
Xo be in A or be a boundary point of A, and let N be a neighborhood of b € R”. 
We say f is eventually in N as x approaches Xo if there exists a neighborhood 
U of xo such that x # xo, x € U, and x € A imply f(x) € N. [The geometric 
meaning of this assertion is illustrated in Figure 2.2.8; note that xo need not be 
in the set A, so that f(xo) is not necessarily defined.] We say f(x) approaches 
b as x approaches xo, or, in symbols, 


limit f(x) = b f(xX)— b as x xo, 


X—Xo 


when, given any neighborhood N of b, f is eventually in N as x approaches xo 
[that is, “ f(x) is close to b if x is close to xo"]. It may be that as x approaches 
Xo, the values f(x) do not get close to any particular number. In this case, we say 


that limit f(x) does not exist. 
X—> Xo 


Henceforth, whenever we consider the notion limit f(x), we shall always assume 
that xo either belongs to some open set on which if is defined or is on the boundary 
of such a set. 

One reason we insist on x Æ Xo in the definition of limit will become clear if we 
remember from one-variable calculus that we want to be able to define the derivative 
f (xo) of a function f at a point xo by 


f(x) = limit 2) — fe) 


Xx— X9 x — Xo 


and this expression is not defined at x = xo. 
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Es 


x 
Figure 2.2.8 Limits in terms of neighborhoods; if x is in U, then f(x) will be in N. 
(The little open circle denotes that the point does not lie on the graph.) In the figure, 
f: A={(x, y) |x? + y? < 1} > R. (The dashed line is not in the graph of f.) 


WIJBE] (a) This example illustrates a limit that does not exist. Consider 
the function f: R — R defined by 


I ifx > 0 
f@)= b ifx <0. 


The limit f(x) does not exist, since there are points x, arbitrarily close to 0 with 
fi (x1) = 1 and also points x2 arbitrarily close to 0 with f(x?) = —1; that is, there 
is no single number that f is close to when x is close to 0 (see Figure 2.2.9). If 
f is restricted to the domain (0, 1) or (—1, 0), then the limit does exist. Can you 
say why? 

(b) This example illustrates a function whose limit does exist, but whose limiting 
value does not equal its value at the limiting point. Define f: IR — R by 


0 ifx £0 
fey = {i ifx — 0. 


Itis true that limit f(x) = 0, since for any neighborhood U of 0, x € U and x #0 


20 
implies that f(x) C 0. One sees from the grapb in Figure 2.2.10 that f approaches 0 
as x — 0; we do not care that f happens to take on some other value at 0. A 
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Figure 2.2.9 The limit of this function as 
x — 0 does not exist. 


(0, 1) Figure 2.2.10 The limit of this function as x — 0 is zero. 


DG12 BÆ] Use the definition to verify that the “obvious” limit x = xo holds, 
XX 
where x and x, € IR". ° 


SOLUTION Let f be the function defined by f(x) = x, and let N be any neigh- 

borhood of xy. We must show that f(x) is eventually in N as x — xo. According to the 

definition, we must find a neighborhood U of xo with the property that if x 4 xo and 

x € U, then f(x) e N. Pick U = N. If x € U, then x € N; because x = f(x), it fol- 

lows that f(x) € N. Thus, we have shown that limit x = Xo. Ina similar way, we have 
— Xo 


limit x = xo, etc. A 
(x,y) (0.0) 


In what follows, the student may assume, without proof, the validity of limits from 
one-variable calculus. For example, limit yx = JVI = 1 and limit sin 0 = sin 0 = 0 
x > 
may be used. 


1 (This example demonstrates another case in which the limit cannot 
simply be "read off" from the function.) Find limit g(x) where 
x 


x-1 


vx-1 


gxe 
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SOLUTION This function is graphed in Figure 2.2.1 (a). 


g*x—d4x«l 


(a) (b) 


Figure 2.2.11 These graphs are the same except that in part (a), g is undefined at 
x = 1, whereas in part (b), g* is defined for all x > 0. 


We see that g(1) is not defined, because division by zero is not defined. However, 
if we multiply the numerator and denominator of g(x) by ./x + 1, we find that for all 
x in the domain of g we have 


x-—1 
sero PP xe. 


The expression g*(x) = ./x + 1 is defined and takes the value 2 at x = 1; from 
one-variable calculus, g*(x) — 2 as x — 1. But because g*(x) = g(x) for all x > 0, 
x Æ 1, we must have as well that g(x) > 2asx > 1. A 


We will consider other examples in two variables shortly. 


Properties of Limits 


To properly speak of the limit, we should establish that f can have at most one limit 
as x — Xo. This is intuitively clear and we now state it formally. (See the Internet 
supplement for the proof.) 


THEOREM 2: Uniqueness of Limits 
If — limitf(xX) - b; — and  limitf(x) bj, then b= b; 
xX Xo xX>X0 


To carry out practical computations with limits, we require some rules for limits, 
for example, that the limit of a sum is the sum of the limits. These rules are summarized 
in the following theorem (see the Internet supplement for Chapter 2 for the proof). 
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THEOREM 3: Properties of Limits Let f: A CR" > R”, g: AC 
R” — R”, xo be in A or be a boundary point of A, b € R”, and c € R; then 
(i) If limit f(x) = b, then limit cf (x) = cb, where cf: A — IR" is defined by 

x ex cf). | 

(ii) If limit f(x) = b; and Inge b2, then limit (f + g)(x) = bi + bo, 
where (f +g): A > R” is defined by x œ> fG)- $ g(x). 

(iii) If m — 1, limit f(x) = bi, and limit g(x) = b», then limit (fg)(x) = bib», 
where ( f; 2): EA — R is defined by x > f(x)g(x). 


(iv) If m=1, limit f(x) 2 5 z:0, and f(x)#0 for all xe A, then 
X—Xo 
limit 1/f(x) = 1/b, where 1/f: A — R is defined by x + 1/f(x). 
XX) 
(v) If f(x) = (A(x), ..., fo (x)) where fi: A > R,i = l,..., m, arethecom- 
ponent functions of f, then limit f(x) = b = (bi,..., bm) if and only if 
XX) 
limit f;(x) = b; foreach i = 1,...,m. 
XX 


These results ought to be intuitively clear. For instance, rule (ii) says that if 
f(x) is close to b, and g(x) is close to b; when x is close to xo, then f(x) + g(x) 


is close to b; + b; when x is close to xo. The following example illustrates how this 
works. 


IS VIEW] Let f: R? > R, (x, y) > x? + y? +2. Compute the limit 


limit y) 
m m 
SOLUTION Here f is the sum of the three functions (x, y) œ> x?, (x, y) > y?, 
and (x, y) +» 2. The limit of a sum is the sum of the limits, and the limit of a product 
is the product of the limits (Theorem 3). Hence, using the fact that limit OX m XQ 
(Example 4), we obtain SIPRI 


limit x?-[ limitx Y limit x )es 
Gy) (0.0) y) oyo) NGcy)9 0,30), 


and, using the same reasoning, limit y? = x. Consequently, 
Gy) (0.0) 


limit f(x, y) 20 +1?+2=3. A 
oim mit f(x, y) 
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Continuous Functions 


In single-variable calculus we learned that the idea of a continuous function is based 
on the intuitive notion of a function whose graph is an unbroken curve, that is, a curve 
that has no jumps, or the kind of curve that would be traced by a particle in motion or 
by a moving pencil point that is not lifted from the paper. 

To perform a detailed analysis of functions, we need concepts more precise than 
this rather vague notion. An example may clarify these ideas. Consider the specific 
function f: IR — R defined by f(x) = —lifx < Oand f(x) = lifx > 0. The graph 
of f is shown in Figure 2.2.12(a). [The little open circle denotes the fact that the point 
(0, 1) does rot lie on the graph of /]. Clearly, the graph of f is broken at x — 0. 
Consider also the function g: x +> x?. This function is pictured in Figure 2.2.12(b). 
The graph of g is not broken at any point. 


g: Xr x? 


»- x 


(a) (b) 


Figure 2.2.12 The function f in part (a) is not continuous, because its value jumps 
as x crosses 0, whereas the function g in part (b) is continuous. 


If one examines examples of functions like f, whose graphs are broken at some 
point xo, and functions like g, whose graphs are not broken, one sees that the principal 
difference between them is that for a function like g, the values of g(x) get closer to 
(xo) as x gets closer and closer to x9. The same idea works for functions of several 
variables. But the notion of closer and closer does not suffice as a mathematical 
definition; thus, we shall formulate these concepts precisely in terms of limits. 

Because the condition limit f(x) = f(xo) means that f(x) is close to f(xo) when 

i) 


x is close to xo, we see that this limit condition does indeed correspond to the require- 
ment that the graph of f be unbroken (see Figure 2.2.13, where we illustrate the 
case f: R — RR). The case of several variables is easiest to visualize if we deal with 
real-valued functions, say f: R? — R. In this case, we can visualize f by drawing 
its graph, which consists of all points (x, y, z) in R? withz — f(x, y). The continuity 
of f thus means that its graph has no “breaks” in it (see Figure 2.2.14). 
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Figure 2.2.13 (a) Discontinuous function for which limit,_,,, f(x) does not exist. 
(b) Continuous function for which this limit exists and equals f (xo). 


N 


z 
Break in the 


surface z — f(x, y) 


RN ` 
rms - ^ 


y i M 
! 
| 
: 


x x 


Set of discontinuities of f; 
i.e., the set of points 
where f is discontinuous 


(a) (b) 
Figure 2.2.14 (a) A discontinuous function of two variables. (b) A continuous 
function. 


DEFINITION: Continuity Let f: A C IR" > R" bea given function with 
domain A. Let x; € A. We say f is continuous at xo if and only if 


limit f(x) = f(xo). 
XX 


If we just say that f is continuous, we shall mean that f is continuous at each 
point xo of A. If f is not continuous at xo, we say f is discontinuous at xo. If f 
is discontinuous at some point in its domain, we say f is discontinuous. 
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D> Gwe Bee) Any polynomial p(x) = ao +aıx 4--::--a,x" is continuous 
from R to R. Indeed, from Theorem 3 and Example 4, 


limit (ao + aix + +++ + a,x") = limit ao + limit aix +--+ + limit a,x” 
XX Xx—Xo x—Xo x—3xo 
= ao + 41X0 + ++ d asxXQ, 


because the limit of a product is the product of the limits, which gives 


n 
limit x" = (limit x)" xf. a 


Xx—Xxo x—Xo 


DBA Let f: R? > R, f(x, y) = xy. Then f is continuous, because, by 
the limit theorems and Example 4, 


limit xy= limit x limit = Xoyo. A 
@.y)>(x0:y0) i N-" ae) n 


One can see by the same method that any polynomial p(x, y) [for example, 
P(x, y) = 3x? — 6xy? + y?] in x and y is continuous. 


DCW] The function f: R? — R defined by 


1 ifx <Oory <0 
fe Y= b otherwise 
is not continuous at (0, 0) or at any point on the positive x axis or positive y axis. 
Indeed, if (xo, yo) = u is such a point (i.e., xo = 0 and yo > 0, or yo = 0 and xo > 0) 
and à > 0, there are points (x, y) € Ds(u), a neighborhood of u, with f(x, y) = 1 
and other points (x, y) € D;(u) with f(x, y) = 0. Thus, it is not true that f(x, y) > 
fo, yo) = 1 as (x, y) > (xo, yo). A 


To prove that specific functions are continuous, we can avail ourselves of the 
limit theorems (see Theorem 3 and Example 7). If we transcribe those results in terms 
of continuity, we are led to the following: 


THEOREM 4: Properties of Continuous Functions Suppose that 
f: ACR" > R",g: AC R" — R", and let c be a real number. 


(i) If f is continuous at Xo, so is cf, where (cf X(x) = c[ f (x)]. 


(ii) If f and g are continuous at xo, so is f + g, where the sum of f and g is 
defined by (f + g)(x) = f(x) + g(x). 
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(iii) If f and g are continuous at xp and m = 1, then the product function fg 
defined by (fg)(x) = f(x)g(x) is continuous at xo. 


(iv) If f: A C R” > R is continuous at xy and nowhere zero on A, then the 
quotient 1/f is continuous at xo, where (1/ f)(x) = 1//(x). 


(v) If f: A C R” > R” and f(x) = (fi(x), ..., fo (X)), then f is continuous 
at Xo ifand only if each of the real-valued functions fi, ..., fm is continuous 
at Xo. 


A variant of (iv) is often used: If /(xo) Æ 0 and f is continuous, then f(x) # 0 
in a neighborhood of xo and so 1/f is defined in that neighborhood, and 1/f is 
continuous at Xo. 


ID CWIEUEU] Let f: R? > R2, (x, y) (x2y, (y 4-x2)/(1 4- x2)). Show that 


f is continuous. 


SOLUTION To see this, it is sufficient, by property (v) of Theorem 4, to show 
that each component is continuous. As we have mentioned, any polynomial in two 
variables is continuous; thus, the map (x, y) +> x?y is continuous. Because 1 + x? 
is continuous and nonzero, by property (iv), we know that 1/(1 4- x?) is continu- 
ous; hence, (y + x?)/(1 + x?) is a product of continuous functions, and by (iii) is 
continuous. A 


Similar reasoning applies to examples like the function e: IR > R? given by 
e(t) = (t2, 1, 9 /(1 + £?) to show they are continuous as well. 


Composition 


Next we discuss composition, another basic operation that can be performed on func- 
tions. If g maps A to B and f maps B to C, the composition of g with f, or of f on 
g, denoted by f o g, maps A to C by sending x +> /(g(x)) (see Figure 2.2.15). For 
example, sin (x?) is the composition of x i x? with y — sin y. 


g A. 
P amm M 
Figure 2.2.15 The composition of f on g. 
fog 
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THEOREM 5: Continuity of Compositions Let g: 4 C R" — R" 
and let f: B C R” — R”. Suppose g(A) C B, so that f o g is defined on A. 
If g is continuous at x9 € A and f is continuous at yo = g(xo), then f og is 
continuous at Xo. 


The intuition behind this is easy; the formal proof in the Internet supplement 
follows a similar pattern. Intuitively, we must show that as x gets close to xo, f(g(x)) 
gets close to f (g(xo)). But as x gets close to xo, g(x) gets close to g(xo) (by continuity 
of g at xo); and as g(x) gets close to g(xo), /(g(x)) gets close to f'(g(xo)) [by continuity 
of f at g(xo)]. 


Let f(x,y,z) = (x? + y? E z2)? + sin z’. Show that f is 
continuous. 

SOLUTION Here we can write f as a sum of the two functions (x? + y? + z2)? 
and sin z?, so it suffices to show that each is continuous. The first is the composite 
of (x,y,z) (x? + y? +z?) with u e u?9, and the second is the composite of 
(x, y, z) > z? with u + sin u, and so we have continuity by Theorem 5. A 


Limits in Terms of £'sand ó's 


We now state a theorem (proved in the Internet supplement for Chapter 2) giving 
a useful formulation of the notion of limit in terms of epsilons and deltas that is 
often taken as the definition of limit. This is, in fact, another way of making precise 
the intuitive statement that “ f (x) is close to b when x is close to xo." To help understand 
this formulation, the reader should consider it with respect to each of the examples 
already presented. 


THEOREM 6 Let f: A C R” > R” and let xo be in A or be a boundary point 
of A. Then limit f(x) = b if and only if for every number € > 0 there is a ô > 0 
such that for any x € A satisfying 0 < ||x — xo|| < ô, we have || f(x) — b|| < € (see 
Figure 2.2.16). 


To illustrate the methodology of the epsilon-delta technique in Theorem 6, we 
consider the following examples. 


DEAE Showthat limit x = 0 using the e- method. 
x.) (0.0) 


SOLUTION Note that if 5 > 0,|(, y) — (0,0)]| = yx? +y? <ô implies 
|x — 0| = |x| = Vx? < /x? + y? < ô. Thus, if ||(x, y) — (0, 0)|| < ô, then |x — 0| 
is also less than ô. Given € > 0, we are required to find a ô > 0 (generally depending 
on €) with the property that 0 < ||(x, y) — (0, 0)|| < à implies |x — 0| < £. What are 
we to pick as our à? From the preceding calculation, we see that if we choose 5 = e, 
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Image of Ds (X 9) 


Figure 2.2.16 The geometry of the &-ó definition of limit. 


then ||(x, y) — (0, 0)|| < 6 implies |x — 0| < £. This shows tan limit ae =0. Given 
x,¥)>(0, 


€ > 0, we could have also chosen ô = &/2 or &/3, but it suffices to find just one à 
satisfying the requirements of the definition of a limit. A 


L2G eS) Consider the function 
sin (x? + y?) 
xs =. 

f(x,y) gay 


Even though f is not defined at (0, 0), determine whether f(x, y) approaches some 
number as (x, y) approaches (0, 0). 


SOLUTION From one-variable calculus or L'Hópital's rule we know that 


. ., Sino 
limit = 
a0 Q 
Thus, it is reasonable to guess that 
n 2 
sin ||v 
limit f(v) = I H = 
v>(0,0) be Iivil 


Indeed, because limit (sin a)/a = 1, given € > 0 we are able to find a à > 0, with 


0 <ô < 1, such that 0 < |a| < à implies that |(sin w)/a — 1| < £. If 0 < ||v|| < 4, 
then 0 < ‘Wie < 6° < 6, and therefore 


sin |lv|? | 


Ivi? 


If) — = 


Thus, m f(v) = 1. If we plot [sin (x? + y?)]/(? + y?) on a computer, we get a 
graph that is indeed well behaved near (0, 0) (Figure 2.2.17). A 
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Figure 2.2.17 Graph of 
the function f(x, y) — 
[sin x? + y?)/G? +y’). 


ID CNIIBDBOEES Show that 


3 
x 
limit | ———— = 0. 
E> /x2 F y? 


SOLUTION We must show that x?/ /x? + y? is small when (x, y) is close to the 
origin. To do this, we use the following inequality: 


— —— 43 
MEET Jerr 
= JP oe. 


Given € > 0, choose 6 = e. Then ||(x, y) — (0, 0)]| = I(x, ») Il = yx? + y?, and so 
lx, y) — (0, 0)|| < ô implies that 


(because y? > 0) 


< yx? + y? = II, y) — (0, Ol «8 =e. 


x? x? 


Thus, the conditions of Theorem 6 have been fulfilled and the limitis verified. a 


DGE (a) Does 


limit x?/(? + y? 
e g fe y) 
exist? [See Figure 2.2.18(a).] 

(b) Prove that [see Figure 2.2.18(b)] 


: 2x? 
limit = 2d 5 — 0. 
(x.y) > (0,0) x? + y 
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As (x, y) approaches 7 
(0, 0) along this 
ridge, z— 1 


Y As (x, y) approaches 
(0, 0) in this 
(a) valley, z > 0 (b) 
Figure 2.2.18 (a) The function z = x?/(x? + y?) has no limit at (0, 0). (b) The 
function z = (2x?y)/(x? + y?) has limit 0 at (0, 0). 


SOLUTION (a) If the limit exists, x?/(x? + y?) should approach a definite 
value, say a, as (x, y) gets near (0, 0). In particular, if (x, y) approaches zero along 
any given path, then x?/(x? + y?) should approach the limiting value a. If (x, y) 
approaches (0, 0) along the line y = 0, the limiting value is clearly 1 (just set y = 0 
in the preceding expression to get x?/x? = 1). If (x, y) approaches (0, 0) along the 
line x = 0, the limiting value is 


lm ———; =041. 
y20 y? P 

Hence, limit x? /(x? + y?) does not exist. 

Gy) (0,0) 
(b) Note that 
2x?y 2x?y 
7 2| I| | = 2): 
x4» x 


Thus, given € > 0, choose à = &/2; then 0 < ||(x, y) — (0,0)] = yx? + y? < 8 
implies |y| < 5, and thus 


2x?y 


M e) 
x? + y? 


<2=¢6 A 


Using the £-ô notation, we are led to the following reformulation of the definition 
of continuity. 


THEOREM 7 Let f: A C R” — R” be given. Then f is continuous at xy € A 
if and only if for every number € > 0 there is a number ô > 0 such that 


xeA and lx — xoll < à implies If (x) — f(xo)ll < £- 
The proof is almost immediate. Notice that in Theorem 6 we insisted that 


0 < |x — xoll, that is, x zz xo. That is not imposed here; indeed, the conclusion of 
Theorem 7 is certainly valid when x — xo, and so there is no need to exclude this 
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case. Here we do care about the value of f at xo; we want f at nearby points to be 
close to this value. 


EXERCISES 


In the following exercises the reader may assume that the exponential, sine, and cosine 
functions are continuous and may freely use techniques from one-variable calculus, such as 
L'Hópital S rule. 


Show that the subsets of the plane in Exercises 1—4 are open: 


1. 


A={(x,y)|-l<x<1,-l<y<I]} 


B = {(x, y) | y > 0} 
C = {(x, y) |2 < x? +y? <4} 
D = {(x, y) | x # 0 and y # 0} 
Compute the limits: 


a) limit x? 
Oa 7 


. 4,C08x —1 
(b) limit ————— 
x20 y 


. Compute the following limits: 


a) limit 
@ @&y)> (0.1) 
sin? x 


ey 


(b) limit 

x0 x 
Compute the following limits: 
(a) limit(x? —3x +5) 


(b) limit sinx 


Compute the following limits if they exist: 


(x+y)? —(@— yf? 


(a) limit 
(x.y) (0.0) xy 
a, BITE 
(b) limit x 
(3)(00 y 


. Compute the following limits if they exist: 


a= 


limit 
S > 0.0) y 
() limi SGT 


&«3)—(00 — x? y? 


aA. 
Nd ru 
Qo Tay -x? 
6 
© EE 


limit | ———— 
(x.y) > (0.0) x2 + y? 


limit ay 


c imit ——— 
©) (xy) > 0,0) x? + y? +2 
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10. Compute the following limits, if they exist: 


e — yy 
(a) limit () limit & =»? 
(00 x 4-1 63) x? 3 y? 
— 1 — (2/2 
(x: dac Sew) 
(3) 0.0) x4 + y4 


11. Compute the following limits if they exist: 


(2) limi “7% 
(3) (0,0) xy 
sin (xyz) 
(b ae 


imi| 
Q2) (000)  xyz 


(c) limit — f(x, y,z), where f(x, y, z) = (x? + 3y? )/(x + 1). 
(x,y,z) — (0,0,0) 


12. Compute the following limits if they exist: 
sin2x — 2. 2x? 
(a) limit SEU Md (c) limi = 2S? 
x3 (x,y,z) (0.0.0) x2 + y? 


in 2x — 2x 
(by time Meme ty 
(3) > (0.0) gy 


13. Compute limit f(x), if it exists, for the following cases: 
xx 
(a) f: R> R,x e |x], xo = 1 


(b) f: R” > R, x} |xll, arbitrary xo 
(c) f: R> Rx e (x, e*), x =1 


(d) f: RNC, 0)} > R?, (x, y) H (sin(x — y), e'6*? — x — D/II@, y)ll, xo = (0, 0). 


14. Let A C R? be the open unit disk D, (0, 0) with the point xo = (1, 0) added, and let 
f: A— R,x f(x) be the constant function f(x) = 1. Show that limit f(x) = 1. 
X— X0 


15. If f: IR" — IR and g: IR" — R are continuous, show that the functions 


Pg: RY > Rx e Ule 
and 
PgR Rx e» [SOPH 
are continuous. 


16. (a) Show that f: R > R, x + (1 — x)? + cos(1 + x?) is continuous. 
(b) Show that the map f: IR — R, x + x?e*/(2 — sin x) is continuous. 


17. (a) Can [sin (x + y)]/(x + y) be made continuous by suitably defining it at (0, 0)? 
(b) Can xy/(x? + y?) be made continuous by suitably defining it at (0, 0)? 
(c) Prove that f: R? — R, (x, y) e» ye + sinx + (xy)* is continuous. 


18. Using either e’s and ó's or spherical coordinates, show that 


Xyz 


limit = = 
G2) (0,0,0) x? + y? + z? 
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19. Use the £-ô formulation of limits to prove that x? — 4 as x — 2. Give another proof 
using Theorem 3. 


20. (a) Prove that for x € R” and s < t, D,(x) C D,(x). 
(b) Prove that if U and V are neighborhoods of x € IR", then so are U N V and UU V. 
(c) Prove that the boundary points of an open interval (a, b) C R are the points a and b. 


21. Suppose x and y are in R” and x Æ y. Show that there is a continuous function 
f: R^ > R with f(x) = 1, f(y) = 0, and 0 < f(z) < | for every z in R”. 


22. Let f: A C IR" — R be given and let x; be a boundary point of A. We say that 
limit f(x) = oo if for every N > 0 there isa ô > 0 such that 0 < ||x — xo|| < ô and x € A 
X— X 
implies f(x) > N. 

(a) Prove that limit (x — 1)? = oo. 

(b) Prove that limit 1/|x| = oo. Is it true that limit 1/x = oo? 

Prove that limit 1/(x° + y?) = oo. 
(c) Prove tha: M I(x" + y)2oo 


23. Let b € Rand f: R\[b] — R be a function. We write limit f(x) = L and say that L 
xb 


is the left-hand limit of f at b, if for every € > 0, there is a 5 > 0 such that x < b and 
0 < |x — b| < ô implies | f(x) — L| < £. 


(a) Formulate a definition of right-hand limit, or limit JG). 
xb 
(b) Find limit 1/(1 + e'/*) and limit 1/(1 + el/*), 
(c) Sketch the graph of 1/(1 + e!/*). 
24. Show that f is continuous at xo if and only if 
limit || f(x) — f(xo)l| = 0. 
XX 
25. Let f: A C R" > R” satisfy || f(x) — f(y)ll € K |x — y|l* for all x and y in A for 
positive constants K and a. Show that f is continuous. (Such functions are called 


Hólder-continuous or, if a = 1, Lipschitz-continuous.) 


26. Show that f: IR" — IR" is continuous at all points if and only if the inverse image of 
every open set is open. 


27. (a) Find a specific number 5 > 0 such that if |a| < 5, then |a? + 3a? + a| < 1/100. 
(b) Find a specific number 5 > 0 such that if x? + y? < 6, then 


|x? + y? + 3xy + 180xy5] < 1/10,000. 


2.3 Differentiation 


In Section 2.1 we considered a few methods for graphing functions. By these methods 
alone it may be impossible to compute enough information to grasp even the general 
features of a complicated function. From elementary calculus we know that the idea 
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of the derivative can greatly aid us in this task; for example, it enables us to locate 
maxima and minima and to compute rates of change. The derivative also has many 
applications beyond this, as the student surely has discovered in elementary calculus. 
Intuitively, we know from our work in Section 2.2 that a continuous function is 
one that has no “breaks” in its graph. A differentiable function from R? to R ought to 
be such that not only are there no breaks in its graph, but there is a well-defined plane 
tangent to the graph at each point. Thus, there must not be any sharp folds, corners, 
or peaks in the graph (see Figure 2.3.1). In other words, the graph must be smooth. 
corner 


v 
ee 2 n 


(3) li (b) 
Figure 2.3.1 (a) A smooth graph and (b) a nonsmooth one. 


z 


z=fx,y) penk 


Partial Derivatives 


To make these ideas precise, we need a sound definition of what we mean by the 
phrase “ f (x1, . .., Xn) is differentiable at x = (x), . . . , Xn)?” Actually, this definition 
is not quite as simple as one might think. Toward this end, however, let us introduce 
the notion of the partial derivative. This notion relies only on our knowledge of one- 
variable calculus. (A quick review of the definition of the derivative in a one-variable 
calculus text might be advisable at this point.) 


DEFINITION: Partial Derivatives Let U C R” bean open set and suppose 
f: U CR" — Risa real-valued function. Then 0//0x),..., 0f/0x,, the par- 
tial derivatives of f with respect to the first, second, ... , nth variable, are the 
real-valued functions of n variables, which, at the point (x), ..., Xn) = x, are 
defined by 


af SRK 2,000 xj Aes Xn) — Ses os Mn) 
da; ie n h 
zie f(x t he;) — f(x) 
h0 h 
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ifthe limits exist, where 1 < j < n and e; is the jth standard basis vector defined 
by e; =(0,..., 1, ..., 0), with 1 in the jth slot (see Section 1.5). The domain 
of the function 3f/ðx; is the set of x € IR" for which the limit exists. 


In other words, 0f/0x; is just the derivative of f with respect to the variable 
xj, with the other variables held fixed. If f: R? — R, we shall often use the notation 
af/ax, 9f/8y, Af/dz inplaceof8f/8xi, Af/Ax2, 8f/80xs.1f f: U C R” > R",then 
we can write 


fas -ees Xn) m (AiE ssi X) fnis se Xn)), 


so that we can speak of the partial derivatives of each component; for example, 
0f, /0x, is the partial derivative of the mth component with respect to xn, the nth 
variable. 


DONWON If f(x,y) — x?y +’, find af/dx and 8//8y. 


SOLUTION To find 8//8x we hold y constant (think of it as some number, say 1) 
and differentiate only with respect to x; this yields 


af deyty) _ 4. 
ax dx ae 


Similarly, to find 3f/ðy we hold x constant and differentiate only with respect to y: 


af _ d@yty’) 


2 2 
= 3y. A 
ay dy x" + Sy’ 


To indicate that a partial derivative is to be evaluated at a particular point, for 
example, at (xo, yo), we write 


af à x 
ax Ox 


Xx=X0,V=Yo (x0,¥0) 


a 
3f xo, yo or 
Ox 


When we write z = f(x, y) for the dependent variable, we sometimes write 0z/àx 
for 9f/àx. Strictly speaking, this is an abuse of notation, but it is common practice 
to use these two notations interchangeably. 


[D CWIIUESPA Ifz = cosxy +x cos y = f(x, y), find the two partial derivatives 
(8z/àx) (xo, yo) and (8z/8y)(xo, yo). 
SOLUTION First we fix yo and differentiate with respect to x, giving 


d(cos xyo + x cos yo) 


m ) 
m x. = 
ax 0, Yo. FE 


— 
= (—yo sin xyo + cos yo)lx=xo 


= —yo Sin Xoyo + COS yo. 
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Similarly, we fix xo and differentiate with respect to y to obtain 


d(cos xoy + xo cos y) 
dy y=» 


ðz 
ay yo) — 


= (—xo sin xoy — xo sin y)|y=y 


= —Xxosinxoyo — Xo SİN yọ. A 


DaB Find 0//0x if f(x, y) 2 xy/ y x? + y?. 
SOLUTION By the quotient rule, 


af yyx-yM-xye/yxX.yh ye x.y-xy — y 
ax x? +y? (x? + yy? (x? + y?)3/2 1 


A definition of differentiability that requires only the existence of partial deriva- 
tives turns out to be insufficient. Many standard results, such as the chain rule for 
functions of several variables would not follow, as Example 4 shows. Below, we shall 
see how to rectify this situation. 


SENEE] Let f(x, y) 2 x!/5y!^. By definition, 


af imit fih, 0) 


f0,0) 0—0 
üx (9.9) "n 


= limit 0, 
h h>0 


and, similarly, (3 f/3y)(0, 0) = 0 (these are not indeterminate forms!). It is necessary 
to use the original definition of partial derivatives, because the functions x!⁄ and y!/? 
are not themselves differentiable at 0. Suppose we restrict f to the line y =x to get 
f(x, x) = x?’ (see Figure 2.3.2). We can view the substitution y = x as the compo- 
sition f o g of the function g: IR — IR?, defined by g(x) = (x, x), and f: R? > R, 
defined by f(x, y)  x!/3y!5, 

Thus, the composite f o g is given by (f o g)(x) = x?/3. Each component of g is 
differentiable in x, and f has partial derivatives at (0, 0), but f o g is not differentiable 
at x = 0, in the sense of one-variable calculus. In other words, the composition of f 
with g is not differentiable in contrast to the calculus of functions of one variable, where 
the composition of differentiable functions is differentiable. Below, we shall give a 
definition of differentiability that has the pleasant consequence that the composition 
of differentiable functions is differentiable. 

There is another reason for being dissatisfied with the mere existence of partial 
derivatives of f(x, y) = x! y!/5: There is no plane tangent, in any reasonable sense, 
to the graph at (0, 0). The xy plane is tangent to the graph along the x and y axes 
because f has slope zero at (0, 0) along these axes; that is, 3f/3x = 0 and 8f/8y = 
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Figure 2.3.2 The portion of the graph of 
x15y!/5 in the first quadrant. 


at (0, 0). Thus, if there is a tangent plane, it must be the xy plane. However, as is 
evident from Figure 2.3.2, the x y plane is not tangent to the graph in other directions, 
because the graph has a severe crinkle, and so the xy plane cannot be said to be 
tangent to the graph of f. 4 


The Linear Approximation 


To "motivate" our definition of differentiability, let us compute what the equation of 
the plane tangent to the graph of f: IR? > R, (x, y) > f(x, y) at (xo, yo) ought to 
be if f is smooth enough. In R?, a nonvertical plane has an equation of the form 


z — ax t by +c. 


If it is to be the plane tangent to the graph of f, the slopes along the x and y axes 
must be equal to 3f/ðx and /f/dy, the rates of change of f with respect to x and y. 
Thus, a = 9f/üx, b = 8f/8y [evaluated at (xo, yo)]. Finally, we may determine the 
constant c from the fact that z = (xo, yo) when x = xo, y = yo. Thus, we get the 
linear approximation: 


z= f(Xo, yo) + [eov] Xo) + [Feo] yo), (1) 
x ay 


which should be the equation of the plane tangent to the graph of f at (xo, yo), if f 
is “smooth enough” (see Figure 2.3.3). 

Our definition of differentiability will mean in effect that the plane defined by 
the linear approximation (1) is a “good” approximation of f near (xo, yo). To get an 
idea of what one might mean by a good approximation, let us return for a moment to 


132 Differentiation 


Tangent plane 
of graph fat 
(xo, yo, f xo, J9)) 


Figure 2.3.3 For points (x, y) near (xo, yo), the 
oyo fixo, 99) graph of the tangent plane is close to the graph of f. 


(xo, yo) 


one-variable calculus. If f is differentiable at a point xo, then we know that 


(xo + Ax) — fo) 


Hit Ax = f (o). 
Let x = xo + Ax and rewrite this as 
limit f) - fGo) = f'(x). 
X—Xo X-—2xg 


Using the trivial limit limit f'(xo) = f’(xo), we can rewrite the preceding equation as 
x30 


limit LO) = fo) = limit f'(xo); 
X—Xo X — X9 X— X0 

that is, 
Bes | fe)- fe) fev] =ü: 
X—> X0 X — Xi 

that is, 


limit L LC) = fex = x0) _ 9 


x—xo x — Xo 


Thus, the tangent line / through (xo, f (xo)) with slope f"(xo) is close to f in the sense 
that the difference between f(x) and I(x) = f(xo) + f" (xo)(x — xo), the equation of 
the tangent line goes to zero even when divided by x — xo as x goes to xo. This is the 
notion of a “good approximation" that we will adapt to functions of several variables, 
with the tangent line replaced by the tangent plane [see equation (1), given earlier]. 
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Differentiability for Functions of Two Variables 


Using the linear approximation, we are ready to define the notion of differentiability. 


DEFINITION: Differentiable: Two Variables Let f: R? > R. We say f 
is differentiable at (xo, yo), if 0f/dx and 0f/dy exist at (xo, yo) and if 


a ð 
F(x, y) — f(xo, yo) — E »]e =)= E »]o — yo) 
lix. y) — Go. yo)ll 


—0 


(2) 


as (x, y) — (xo, yo). This equation expresses what we mean by saying that 


fean) + E coso |e x0) + E Gov o: » 
x oy 


is a good approximation to the function f. 


It is not always easy to use this definition to see whether f is differentiable, but 
it will be easy to use another criterion, given shortly in Theorem 9. 


Tangent Plane 


We have used the informal notion of the plane tangent to the graph of a function 
to motivate our definition of differentiability. Now we are ready to adopt a formal 
definition of the tangent plane. 


DEFINITION: Tangent Plane Let f: R? > R be differentiable at xo = 
(xo, yo). The plane in R? defined by the equation 


a a 
z= f(Xo, yo) + E w]e xo) + [feo] Yo), 


is called the tangent plane of the graph of f at the point (xo, yo). 


6.078829 Compute the plane tangent to the graph of z = x? + y+ + e at 
the point (1, 0, 2). 


SOLUTION Use formula (1), with xo = 1, yo = 0, and zo = f(xo, yo) =2. The 
partial derivatives are 


a a 
& nx + ye? and Za 4y? + xe?. 
Ox ay 
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At (1, 0, 2), these partial derivatives are 2 and 1, respectively. Thus, by formula (1), 
the tangent plane is 


z=2(x —1)+ l(y -0) +2, that is, z=2x+y. A 
Let us write Df (xo, yo) for the row matrix 


a 9 
[Zo yo) L en. vo} 


so that the definition of differentiability asserts that 


ei 


xo yo) + Df'(xo. »[; = 


= fes | Zoom] -2)+[Zoo0mfo—m — o 
x ðy 


is our good approximation to f near (xo, yo). As earlier, “good” is taken in 
the sense that expression (3) differs from f(x,y) by something small times 


V(x — xo)? + (y — yo). We say that expression (3) is the best linear approxima- 


tion to f near (xo, yo). 


Differentiability: The General Case 


Now we are ready to give a definition of differentiability for maps f of R” to R”, using 
the preceding discussion as motivation. The derivative Df (xo) of f = (fi,..., fm) 
at a point xo is a matrix T whose elements are t;; = 3f; /0x; evaluated at xo.? 


DEFINITION: Differentiable, n Variables, m Functions Let U bean open 
set in R" and let f: U C IR" — R" be a given function. We say that f is differ- 
entiable at xo € U if the partial derivatives of f exist at xo and if 


Iœ- f») = Tx = xoll _ 


limit 0, (4) 
9% IIx — xoll 

where T = D/(xo) is the m x n matrix with matrix elements 8f; /8x; evaluated 

at Xy and T(x — xo) means the product of T with x — xo (regarded as a column 

matrix). We call T the derivative of f at xo. 


?]t turns out that we need to postulate the existence of only some matrix giving the best linear approximation near xy € R^, 
because in fact this matrix is necessarily the matrix whose i jth entry is à f; /àx ; (see the Internet supplement for Chapter 2). 
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We shall always denote the derivative T of f at xo by D f (xo), although in some 
books it is denoted d f (xo) and referred to as the differential of f. In the case where 
m = 1, the matrix T is just the row matrix 


KJ sas ace]. 
xi 


Ox, 


(Sometimes, when there is danger of confusion, we separate the entries by commas.) 
Furthermore, setting n = 2 and putting the result back into equation (4), we see that 
conditions (2) and (4) do agree. Thus, if we let h = x — xo, a real-valued function f 
of n variables is differentiable at a point xo if 


n 


ð 
|r +h)- f@)-) 7 4 -(xo)hj 
J 


Jal 


1 
lihi] 


=0, 


limit 
h0 
because 
a ð 
Th= ch rL. 
j=l xj 


For the general case of f mapping a subset of R” to IR", the derivative is the 
m x n matrix given by 


af a 
Ox) OXn 
Df(x)-]| : & s 
fm, Om 
Ox) OX, 


where @f;/0x; is evaluated at xo. The matrix Df(xo) is, appropriately, called the 
matrix of partial derivatives of f at xo. 


Calculate the matrices of partial derivatives for these functions. 
(2) f(x. y) 9 (€? + y, yx) 

(b) f(x, y) = G? + cosy, ye") 

() f(y. 2) = Ze", —ye*) 

SOLUTION 


(a) Here f: R2 > R? is defined by fi(x, y) =e +y and fo(x, y) = yx. 
Hence, Df (x, y) is the 2 x 2 matrix 

[e eva 

»e.»-[ y 2xy | 
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(b) We have 


Df(, y) = ke bui 


(c) In this case, 


wot til. 


Gradients 


For real-valued functions we use special terminology for the derivative. 


DEFINITION: Gradient Consider the special case f: U C R” — R. 
Here Df(x) isa 1 x n matrix: 


E an x 


Df(x) = | 


ax, OX, 


We can form the corresponding vector (8f/8x;, . .. , 0f/0x,), called the gradient 
of f and denoted by Vf or grad f. 


From the definition, we see that for f: R? > R, 
while for f: R? > R, 


The geometric significance of the gradient will be discussed in Section 2.6. In 
terms of inner products, we can write the derivative of f as 


Df(x)(h) = V(x) -h. 


ene Let /: R? > R, f(x, y, z) 2 xe". Then 


(9f Oh OP) 
VETE AE A 
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If f: R? > Ris given by (x, y) œ> e? + sinxy, then 


Vf(x, y) = (ye? + y cos xy)i + (xe? + x cos xy)j 
= (e? +cosxy)(yit+xj). A 


Inone-variable calculus it is shown that if f is differentiable, then f is continuous. 
We will state in Theorem 8 that this is also true for differentiable functions of several 
variables. As we know, there are plenty of functions of one variable that are continuous 
but not differentiable, such as f(x) = |x|. Before stating the result, let us give an 
example of a function of two variables whose partial derivatives exist at a point, but 
which is not continuous at that point. 


I, SWSIUEUD] Let f: R? — R be defined by 


1 ifx =Oorify =0 
fœy)= h otherwise. 
Because f is constant on the x and y axes, where it equals 1, 
a a 
3f oo, 020 and fo, 0) = 0. 
Ox oy 


But f is not continuous at (0, 0), because, limit, 5 f(x, y) does not exist. A 
x, y) (0. 


Some Basic Theorems 


The first of these basic theorems relates differentiability and continuity. 


THEOREM 8 Let f: U C R” > R” be differentiable at xy € U. Then f is 
continuous at Xo. 


This result is very reasonable, because “differentiability” means that there is 
enough smoothness to have a tangent plane, which is stronger than just being contin- 
uous. Consult the Internet supplement for Chapter 2 for the formal proof. 

As we have seen, it is usually easy to tell when the partial derivatives of a function 
exist using what we know from one-variable calculus. However, the definition of dif- 
ferentiability looks somewhat complicated, and the required approximation condition 
in equation (4) may seem, and sometimes is, difficult to verify. Fortunately, there is 
a simple criterion, given in the following theorem, that tells us when a function is 
differentiable. 


THEOREM 9 Let f: U C IR" > R". Suppose the partial derivatives 3f; /3x; 
of f all exist and are continuous in a neighborhood of a point x € U. Then f is 
differentiable at x. 
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We give the proof in the Internet supplement for Chapter 2. Notice the following 
hierarchy: 


Definition 
Theorem 9 of derivative 


Continuous partials = Differentiable = Partials exist 


Each converse statement, obtained by reversing an implication, is invalid. [For a 
counterexample to the converse of the first implication, use f(x) = x? sin(1/x), 
(0) = 0; for the second, see Example | in the Internet supplement for Chapter 2 or 
use Example 4 in this section.] 

A function whose partial derivatives exist and are continuous is said to be of class 
C!. Thus, Theorem 9 says that any C! function is differentiable. 


EXAMPLE Let 


cosx + e? 


f@.y)= ae 


Show that f is differentiable at all points (x, y) z (0, 0). 
SOLUTION Observe that the partial derivatives 


af _ @ +y ye? — sinx) — 2x(cosx + e”) 
ax (x? + y2} 


af (x? +y?)xe” — 2y(cosx + e”) 
ay — + yy? 


are continuous except when x = 0 and y = 0 (by the results in Section 2.2). Thus, f 
is differentiable by Theorem 9. A 


In the Internet supplement we show that f(x, y)=xy/yx? +y? [with 
f (0, 0)=0] is continuous, has partial derivatives at (0, 0), yet is not differentiable 
there. See Figure 2.3.4. By Theorem 9, its partial derivatives cannot be continuous at 
(0, 0). 
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= 
GZ 


By Figure 2.3.4 This function is not differentiable 
Gj at (0, 0), because it is “crinkled.” 


GY 


o 


EXERCISES 
1. Find af/ax, 9f /ày if 


(a) fx, y) 2 xy 

(b) f(x, y) =e” 

(c) f(x, y) = x cosx cos y 

(d) f(x, y) = (x? + y?) log (x? + »?) 


2. Evaluate the partial derivatives 0z/dx, 0z/dy for the given function at the indicated 
points. 


(a) z= ya? — x? — y^; (0, 0), (a/2, a/2) 
(b) z = log /1 + xy; (1, 2), (0, 0) 
(c) z = e™ cos(bx + y);(27/b, 0) 


3. In each case following, find the partial derivatives dw/dx, dw/dy. 


(a) w = xet? (d) w — x/y 
i us x+y (e) w = cos ( ye”) sinx 
xy? 


(c) w = e" log (x? + y?) 


4. Show that each of the following functions is differentiable at each point in its domain. 
Decide which of the functions are C!. 


, _ 2xy " xy 
(a) f(x. y) = Ga yn (à) fG(, y) — py 
ENT. xy 
O fe) 0 fe»- urs 


(c) f(r, 0) = įr sin20,r > 0 
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5. Find the equation of the plane tangent to the surface z = x? + y? at (3, 1, 10). 


6. Using the respective functions in Exercise 1, compute the plane tangent to the graphs at 
the indicated points. 


(a) (0,0) (b) (0, 1) (c) (0, 7) (d) (0, 1) 
7. Compute the matrix of partial derivatives of the following functions: 

(a) f: R > R, f(x,y) =, y) 

(b) f: R > R, f(x, y) = (xe + cosy, x, x +e”) 

(c) f: R — R, f(x, y, z) = (x +ë + y, yx?) 

(d) f: R? > R?, f(x, y) = (xye"', x sin y, 5xy?) 


8. Compute the matrix of partial derivatives of 


(a) f(x, y) = (e, sinxy) (c) f(x y) 2 G yx — y, xy) 
(b) f(x,y,z) = (x — y, y +2) (d) f(x, y, z) = (Œ +z, y — 5z, x — y) 


9. Where does the plane tangent to z = e*~” at (1, 1, 1) meet the z axis? 


10. Why should the graphs of f(x, y) = x? + y? and g(x, y) = —x? — y? + xy? be called 
“tangent” at (0, 0)? 


11. Let f(x, y) = e”. Show that x(af/x) = y(3f/3y). 


12. Use the linear approximation to approximate a suitable function f(x, y) and thereby 
estimate the following: 


(a) (0.990292) 
(b) (0.99) + (2.01)? — 6(0.99)(2.01) 
(c) (4.01)? + (3.98)? + (2.02)? 

13. Compute the gradients of the following functions: 


(a) f(x, y, Z) = x exp(—x? — y? — z?) (Notethatexpu = e".) 
p y 


(b) f(x. y 2) = = 


SE cay 
eyez (c) f(x, y, z) 2 ze cosy 


14. Compute the tangent plane at (1, 0, 1) for each of the functions in Exercise 13. [The 
solution to part (c) only is in the Study Guide.] 


15. Find the equation of the tangent plane to z = x? + 2y? at (1, 1,3). 
16. Calculate VA(1, 1, 1) if A(x, y, z) = (x + zje. 
17. Let f(x, y, zZ) =x? + y? — z?. Calculate V/(0, 0, 1). 


18. Evaluate the gradient of f(x, y, z) = log (x? + y? + z?) at (1, 0, 1). 
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19. Describe all Hólder-continuous functions with œ > 1 (see Exercise 25, Section 2.2). 
(Hint: What is the derivative of such a function?) 


20. Suppose f: IR" — IR” is a linear map. What is the derivative of f? 


2.4 Introduction to Paths and Curves 


In this section, we introduce some of the basic geometry and computational methods 
for paths in the plane and space. This will be an important ingredient for the chain 
rule treated in the next section. We will return to paths with additional topics in 
Chapter 4. 


Paths and Curves 


One often thinks of a curve as a line drawn on paper, such as a straight line, a circle, 
or a sine curve. It is useful to think of a curve C mathematically as the set of values 
of a function that maps an interval of real numbers into the plane or space. We shall 
call such a map a path. We usually denote a path by c. The image C of the path then 
corresponds to the curve we see on paper (see Figure 2.4.1). Often we write t for the 
independent variable and imagine it to be time, so that c(t) is the position at time t of 
a moving particle, which traces out a curve as t varies. We also say c parametrizes 
C. Strictly speaking, we should distinguish between c(/) as a point in space and as a 
vector based at the origin. 


c (b) 
CV curve C = image of c 


Figure 2.4.1 The map c is the path; its 
image C is the curve we "see." 


DEWIS] The straight line L in R? through the point (xo, yo, zo) in the 
direction of vector v is the image of the path 


e(t) = (Xo, Yo, zo) + tv 


fort € R (see Figure 2.4.2). Thus, our notion of curve includes straight lines as special 
cases. A 
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c(t)— (xo Yo» Zo) + tv 


(xo; Yo» Zo) 
Figure 2.4.2 L is the straight line in space through 
(xo, Yo, zo) and in direction v; its equation is 

e(t) = (xo, yo. Zo) + tv. 


DGIB] The unit circle C: x? + y? = 1 in the plane is the image of the 
path 


c: R 5 R2, c(t) = (cos t, sin t), 0<t<2z, 


(see Figure 2.4.3). The unit circle is also the image of the path (f) = (cos 2t, sin 2t), 
0 < t < n. Thus, different paths may parametrize the same curve. A 


e(t) 
A \ Figure 2.4.3 c(t) = (cost, sint) is a path whose image C is the 
F7 id unit circle. 


Paths and Curves A path in R" is a map c: [a, b] — R"; it is a path in the 
plane if n = 2 and a path in space if n = 3. The collection C of points c(t) as t 
varies in [a, b] is called a curve, and c(a) and c(5) are its endpoints. The path c 


is said to parametrize the curve C. We also say c(t) traces out C as t varies. 

Ifc isa path in R?, we can write c(t) = (x(t), y(t), z(t) and we call x(t), y(t), 
and z(t) the component functions of c. We form component functions similarly 
in R? or, generally, in R”. 


Wie The path c(t) = (t, t?) traces out a parabolic arc. This curve coin- 
cides with the graph f(x) = x? (see Figure 2.4.4). A 
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Figure 2.4.4 The image of e(t) = (t, t?) is the 
aED (i, 1)= e(1) parabola y = x?. 


» x 


«(0) = (0, 0) 


wie} A wheel of radius R rolls to the right along a straight line at speed 
v. Use vector methods to find the path c(t) of the point on the wheel that initially lies 
at a distance r below the center. 


SOLUTION We place the wheel in the xy plane with its center initially at (0, R), 
so that the position of the center at time ¢ is given by the path C(t) = (vt, R). (Refer 
to Figure 2.4.5.) 


e C) 


I 
A C(t) 


dín) 


Ci) 


dí(4) 


a 


e(n) 


Figure 2.4.5 The vector d(/) points from the wheel’s center, C(t), to the position 
c(t) of a point on the wheel and rotates in the clockwise direction while the wheel 


moves to the right. 


The position of the point ¢(¢) relative to the center is given by the vector d(t) = 
c(t) — C(t) that has the initial value —rj and rotates in the clockwise direction. The 
rate of rotation is such that the wheel makes a full rotation after the center has moved 
a distance 2z A (equal to the circumference of the wheel). This takes a time 277R/v, 
so the angular velocity d6 /dt of the wheel is v/ R. Because the rotation is clockwise, 
the vector function d(t) is of the form 


ao =r (eos zt ! oli+sin[ a) 
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for some initial angle 0. Because d(0) = —rj, we have cos0 = 0 and sin 0 = —1, so 
0 = —7/2, and hence 


v X. sos v m]. 
ao =r (cos[ R! 5 ji+ sin [ R! sli): 
Using cos (p — 7/2) = sin q and sin (p — 1/2) = — cos 9, along with cos (—g) = 
cos 9 and sin(—9) = —sin g, we get 


t t 
dt) =r (-sin i — cos ai). 


Finally, the path e(¢) is given by adding the components of the vector function d(/) to 
the coordinates of the path C(t); the result is 


vt vt 
[3 t i ,R š 
e(t) (» r sin R rcos x) 


In the special case v = R =r = 1, we get c(t) = (t — sint, 1 — cos t). The image 
curve C of this path c is shown in Figure 2.4.6; it is called a cycloid. A 


c(t) -(t—sint, 1—cost) 


Figure 2.4.6 The curve traced by a 
point moving on the rim of a rolling 
circle is called a cycloid. 


The preceding example considered the path of a point not necessarily on the rim 
ofa wheel rolling along a straight line. When the wheel rolls on a circle, the resulting 
curve is called an epicycle. These are the epicycles discussed in the Ptolemaic theory 
in the introduction. If the wheel is outside the circle and the point is on the rim, the 
curve is called an epicycloid, and when the wheel is inside the circle it is a hypocycloid. 
An example of the latter is shown in Figure 2.4.7. 


Figure 2.4.7 An example of a hypocycloid. 
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Velocity and Tangents to Paths 


If we think of ¢(t) as the curve traced out by a particle and ¢ as time, it is reasonable 
to define the velocity vector as follows. 


DEFINITION: Velocity Vector Ifc is a path and it is differentiable, we say 
c is a differentiable path. The velocity of c at time t is defined by? 


ét) = jim COO. 


We normally draw the vector c'(f) with its tail at the point c(t). The speed of 
the path e(t) is s = ||e’(t)||, the length of the velocity vector. If c(t) = (x(t), y(t)) 
in R2, then 


e'(t) = E), y'(0)) = ¥'Oi + YOK 


and if c(t) = (x(t), y(t), z(t)) in R?, then 


e(t) = K'A), y' (0), z/() = x (Oo y' (Oi + z' GO). 


Here, x’(t) is the one-variable derivative dx /dt. If we accept limits of vectors inter- 
preted componentwise, the formulas for the velocity vector follow from the definition 
of the derivative. However, the limit can be interpreted in the sense of vectors as well. 
In Figure 2.4.8, we see that [c(t + A) — c(t)]/ h approaches the tangent to the path as 
h 0. 


?If t lies at the endpoint of an interval, one should, as in one-variable calculus, take right- or left-handed limits. 
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e'() 


elt +h) — e(r) 


Figure 2.4.8 The vector c'(t) is 
tangent to the path c(t). 


Tangent Vector The velocity c'(t) is a vector tangent to the path c(t) at time 


t. If C isa curve traced out by c and if c'(t) is not equal to 0, then c'(/) is a vector 
tangent to the curve C at the point e(t). 


If we think of the derivative De(t) as a matrix, it will be a column vector with the 
entries x'(t), y'(t), and z’(t). Thus, the derivative here is consistent with our earlier 
notion. 


DGE Compute the tangent vector to the path e(t) = (t, t, e')att =0. 


SOLUTION Here c'(t) = (1, 2t, e'), and so at t = 0 we obtain the tangent vector 
(1,0,1). A 


DONAA Describe the path c(t) = (cos f, sint, t). Find the velocity vector 


at the point on the image curve where t = 7/2. 


SOLUTION Fora given t, the point (cos f, sin t, 0) lies on the circle x? + y? = 1 
in the xy plane. Therefore, the point (cost, sint, ¢) lies ¢ units above the point 
(cos t, sin t, 0) if ¢ is positive and —/ units below (cos t, sin‘, 0) if t is negative. As 
t increases, (cos f, sint, t) wraps around the cylinder x? + y? = 1 with the z coordi- 
nate increasing. The curve this traces out is called a helix, which is depicted in Figure 
2.4.9. At t = 2/2, €'(1/2) = (—sinz/2, cosz/2, 1) 2 (-1,0,1) 2 —i--K. A 


PWIA Be The cycloidal path of a particle on the edge of a wheel of radius 
R with velocity v is given by c(t) = (vt — Rsin(vt/ R), R — R cos (vt /R)). (See 
Example 4.) Find the velocity c'(t) of the particle as a function of t. When is the 
velocity zero? Is the velocity vector ever vertical? 
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c (1/2) =-i+k 
(cos t, sin t. £) Figure 2.4.9 The helix c(t) = (cos f, sint, t) 
y wraps around the cylinder x? + y? = 1. 


(cos t, sin t, 0) 


SOLUTION To find the velocity, we differentiate: 


d t\ d t 
e'(r) — E c asi. 2; (^ R cos xJ) 
u vt — , vt 
= (v7 veos p vsin = J. 


In vector notation, c'(f) = (v — v cos (vt/ R))i + (v sin (vt/ R))j. The component in 
the direction of i is v(1 — cos(vt/ R)), which is zero whenever vt/R is an integer 
multiple of 27r. For such values of t, sin (vt/ R) is zero as well, so the only times at 
which the velocity is zero are when t = 27nR/v for some integer n. At such times, 
c(t) = (27nR, 0), so the moving point is touching the ground. These moments occur 
at time intervals of 2z:R/v (more frequently for small wheels, as well as for rapidly 
rolling ones). 

The velocity vector is never vertical, because the horizontal component vanishes 
only when the vertical one does as well. A 


Figure 2.4.10 shows some velocity vectors superimposed on the cycloidal path of 
Figure 2.4.6. 


Figure 2.4.10 Velocity vectors for the curve 
traced out by a point on the rim of a rolling wheel. 
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Tangent Line 


The tangent line to a path at a point is the line through the point in the direction 
of the tangent vector. Using the point-direction form of the equation of a line, we 
obtain the parametric equation for the tangent line. 


Tangent Line to a Path If c(t) is a path, and if c'(to) # 0, the equation of its 
tangent line at the point c(fo) is 


Kt) = e(to) + (t — to)e (to). 


If C is the curve traced out by c, then the line traced out by / is the tangent line 
to the curve C at c(fo). 


Notice that we have written the equation in such a way that / goes through the 
point e(fo) at £ = fo (rather than ¢ = 0). See Figure 2.4.11. 


Figure 2.4.11 The tangent line to a path. 


ID CWZIQEDE] A path in R? goes through the point (3, 6, 5) at  — 0 with tangent 
vector i — j. Find the equation of the tangent line. 


SOLUTION The equation of the tangent line is 


Kt) = (3, 6, 5) + t(i— j) = (3, 6, 5)+ t(1, —1, 0) = (8756—1,5). 
In (x, y, z) coordinates, the tangent line is x = 3 +t, y =6—t,z=5. A 


Physically, we can interpret motion along the tangent line as the path that a 
particle on a curve would follow if it were set free at a certain moment. 


: WIJS] Suppose that a particle follows the path c(t) = (e', e™', cos t) until 
it flies off on a tangent at t = 1. Where is it at t = 3? 
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SOLUTION The velocity vector is (e', —e~', —sin t), which at £ = 1 is the vector 
(e, —1/e, —sin 1). The particle is at (e, 1/e, cos 1) att = 1. The equation ofthe tangent 
line is (t) = (e, 1/e, cos 1) + (t — 1)(e, —1/e, —sin 1). At t = 3, the position on this 
line is 


1 
K3) = (e. — 1) +2 (e —, —sin )) ES (ze i cos l — 2 sin )) 
e e e 


= (8.155, —0.368, —1.143). A 


EXERCISES 


Sketch the curves that are the images of the paths in Exercises 1 to 4. 
1. x = sint, y = 4cost, where 0 < t < 27 
2. x =2sint, y = 4cost, where 0 < t < 27 
3. c(t) 2 Qt — 1l, t -2, t) 
4. c(t) = (—t, 2t, 1/t), where 1 < t € 3 

In Exercises 5 to 8, determine the velocity vector of the given path. 
5. c(t) = 6ti 4- 3°) +k 
6. c(t) = (sin 3t)i + (cos 34)j + 27k 
7. r(t) = (cos t, 3t — C, t) 
8. r(t) = (4e', 6^, cost) 

In Exercises 9 to 12, compute the tangent vector to the given path. 
9. c(t) = (e', cost) 

10. e(t) = (37, P) 

11. c(t) = (t sint, 4t) 

12. c(t) = (2, e) 


13. When is the velocity vector of a point on the rim of a rolling wheel horizontal? What is 
the speed at this point? 


14. If the position of a particle in space is (6t, 317, t°) at time t, what is its velocity vector at 
t=0? 


150 Differentiation 


In Exercises 15 and 16, determine the equation of the tangent line to the given path at the 
specified value of t. 


15. (sin3t, cos3t, 2052); = 1 
16. (cos? t, 3t — 0, t);t — 0 


In Exercises 17 to 20, suppose that a particle following the given path c(t) flies off on a 
tangent at t = tg. Compute the position of the particle at the given time ty. 


17. c(t) = (?, ? — 4t, 0), where to = 2, t =3 
18. c(t) = (e', e™', cost), where f = 1,5 = 2 
19. c(t) = (4e', 61^, cost), where f = 0, ti = 1 


20. c(t) = (sine', 1,4 — t’), where to = 1, ti =2 
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In elementary calculus, we learn how to differentiate sums, products, quotients, and 
composite functions. We now generalize these ideas to functions of several variables, 
paying particular attention to the differentiation of composite functions. The rule for 
differentiating composites, called the chain rule, takes on a more profound form for 
functions of several variables than for those of one variable. 

If f is a real-valued function of one variable, written as z = f(y), and y is a 
function of x, written y = g(x), then z becomes a function of x through substitution, 
namely, z = f(g(x)), and we have the familiar chain rule: 


=> -f(gx)yg). 


If f is a real-valued function of three variables u, v, and w, written in the form 
z = f(u, v, w), and the variables u, v, w are each functions of x, u = g(x), v = A(x), 
and w = k(x), then by substituting g(x), A(x), and k(x) for u, v, and w, we obtain z 
as a function of x: z = f(g(x), h(x), k(x)). The chain rule in this case reads: 


dz | 0üz du P əz dv | dz dw 
dx ðudx dvdx | dw dx’ 


One of the goals of this section is to explain such formulas in detail. 
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Sums, Products, Quotients 


These rules work just as they do in one-variable calculus. 


THEOREM 10: Sums, Products, Quotients 
(i) Constant Multiple Rule. Let f: U C R” — R” be differentiable at xo 
and let c be a real number. Then A(x) = cf (x) is differentiable at xp and 


DA(xo) = cDf(xo) (equality of matrices). 


(ii) Sum Rule. Let f: U C R” — R” and g: U C IR" — R” be differentiable 
at xo. Then h(x) = f(x) + g(x) is differentiable at x; and 


Dh(xo) = Df (x0) + Dg(xo) (sum of matrices). 


(iii) Product Rule. Let f: U C R” — Randg: U C R” — R be differentiable 
at xo and let h(x) = g(x) f (x). Then ^: U C R” — R is differentiable at xo 
and 


DA(xo) = g(xo)Df (xo) + f (xo)Dg(xo). 
(Note that each side of this equation is a 1 x n matrix; a more general 
product rule is presented in Exercise 29 at the end of this section.) 


(iv) Quotient Rule. With the same hypotheses as in rule (iii), let h(x) = 
f(x)/g(x) and suppose g is never zero on U. Then A is differentiable at 
xo and 


£(xo)D/ (xo) — f/(xo)Dg(xo) 
[g(xo)? : 


DA (xo) = 


PROOF The proofs of rules (i) through (iv) proceed almost exactly as in the one- 
variable case with a slight difference in notation. We shall prove rules (i) and (ii), 
leaving the proofs of rules (iii) and (iv) as Exercise 25. 


(i) To show that D/(xo) = cDf(xo), we must show that 


AQ) — Ao) — cDf(xo)x — Xo) I] _ 


E Ix — xol " 
that is, that 
imit WL) — ef 0) — Do — x0) _ 9 


X3) lix — xoll 
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[see equation (4) of Section 2.3]. This is certainly true, since f is differentiable 
and the constant c can be factored out [see Theorem 3(i), Section 2.2]. 


(ii) By the triangle inequality, we may write 


lA) — (xo) — [Df (xo) + Dg(xo)l(x — xo)ll 


lix — xoll 
IF) — f (xo) — [D (xo)(x — xo) + g(x) — (xo) — [Dg(xo)lx — xo)ll 
= Ix — xoll 
1/69 = fo) = [Df CoN — xo)Il | llgG) — glo) — Dgo] — xol 
~ Ix — xoll Ix — xoll ' 


and each term approaches 0 as x — xo. Hence, rule (ii) holds. m 
Verify the formula for Dh in rule (iv) of Theorem 10 with 
fŒ, yz) =x? + y? +27 and g(x, y,z) 2 x? 1. 
SOLUTION Here 


x? + y? +2? 


h(x, yi 


, 


so that by direct differentiation 


ðh dh dh (x? + 1)2x — (x? +y? +27)2x 2y 2z 
ax’ dy’ dz (x? + 1)? "x2 41 x241 
2x(l—y?—z*) 2y 22 

(2412 7x2 417 x? 41) 


DA(x, y, z) [ 


By rule (iv), we get 


gDf — fDg _ Œ? + I)[2x, 2y, 2z] — (x? + y? + z°)[2x, 0, 0] 
g (x? 4 1) ý 


Dh 


which is the same as what we obtained directly. A 


Chain Rule 


As we mentioned earlier, it is in the differentiation of composite functions that we 
meet apparently substantial alterations of the formula from one-variable calculus. 
However, if we use the D notation, that is, matrix notation for derivatives, the chain 
rule for functions of several variables looks similar to the one-variable rule. 
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THEOREM 11: Chain Rule Let U C R” and V C R” be open sets. Let 
g: UCR" > R" and f: V C R" — R? be given functions such that g maps 
U into V, so that f o g is defined. Suppose g is differentiable at xp and f is 
differentiable at yo = g(xo). Then f o g is differentiable at xo and 


D(f o g)(xo) = Df(yo)Dg(xo). a) 


The right-hand side is the matrix product of D f (yọ) with Dg(xo). 


We shall now give a proof of the chain rule under the additional assumption 
that the partial derivatives of f are continuous, building up to the general case by 
developing two special cases that are themselves important. (The complete proof of 
Theorem 11 without the additional assumption of continuity is given in the Internet 
supplement for Chapter 2.) 


First Special Case of the Chain Rule 


Suppose c: IR — R? is a differentiable path and f: R? > R. Let A(t) = f(e(t)) = 
SEH), y). z()). where e(t) = (x(t), y(t), 2(). Then 


dh  Ofdx | ðf dy n af dz Q) 
dt  Oxdt ` ðydt dzdt 
That is, 
dh 


— =V t))- c(t), 
CORO) 
where e'(t) = (x'(t), y'(t), z'(0)). 
This is the special case of Theorem 11 in which we take c = g and f to be 
real-valued, and m — 3. Notice that 


Vf (c(t) - c(t) = Df (eet), 


where the product on the left-hand side is the dot product of vectors, while the product 
on the right-hand side is matrix multiplication, and where we regard Df (c(t)) as a row 
matrix and De(t) as a column matrix. The vectors V/f(e(t)) and c'(t) have the same 
components as their matrix equivalents; the notational change indicates the switch 
from matrices to vectors. 


PROOF OF EQUATION (2). By definition, 


h(t) — h(to) 
t—t C 


df oy eid 
dt (to) = i 
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Adding and subtracting two terms, we write 
h(t) — h(to) — F&M, YO, 2) — F&C), (to), z(to)) 
t— t — to 
_ Gt), yH), 2) — F&M), YO, 2) 
t — to 
+ LX (to), YO), z(t)) — fGx(to), V(to), 2) 
1— to 
4 L(X(to), y(to), z(0) — fGx(to), y(to), z(to)) 
t — tg ` 
Now we invoke the mean-value theorem from one-variable calculus, which states: 
If g: [a, b] — R is continuous and is differentiable on the open interval (a, b), then 
there is a point c in (a, b) such that g(b) — g(a) = g'(c)(b — a). Applying this to f 
as a function of x, we can assert that for some c between x and xo, 


I(x, yz) — fo, y. z) = E »aJe — xo). 


In this way, we find that 
Kos jg a = KORN +[¥ -— «|? X — Y) 
— to to — fo 
*[Zeto. xt). o 9 69. 


where c, d, and e lie between x(t) and x (to), between y(t) and y(to), and between z(t) 
and z(fo), respectively. Taking the limit / — tọ, using the continuity of the partials 
af/ax, af/dy, 0f/0z, and the fact that c, d, and e converge to x(to), y(to), and z(to), 
respectively, we obtain formula (2). m 


Second Special Case of the Chain Rule 
Let f: R? — Rand let g: IR? > R?. Write 
g(x, y, z) = (U(X, y, z), vx, y, z), W(X, y, z)) 
and define h: IR? — R by setting 
h(x, y, z) = fu, y, z), v(x, y, z), w(x, y, z)). 


In this case, the chain rule states that 


ðu ðu du 

ax dy az 
E oh s l-[z af a ðv ðv av 6) 
dx dy dz ðu ðv dw] ax ay az 
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In this special case, we have taken n — m — 3 and p — 1 for concreteness, 
and U — R? and V — R? for simplicity, and have written out the matrix prod- 
uct [D/(yo)][Dg(xo)] explicitly (with the arguments xo and yo suppressed in the 
matrices). 


PROOF OF THE SECOND SPECIAL CASE OF THE CHAIN RULE. By 
definition, 0h/dx is obtained by differentiating A with respect to x, holding y and z 
fixed. But then (u(x, y, z), v(x, y, z), w(x, y, z)) may be regarded as a vector function 
of the single variable x. The first special case applies to this situation and, after the 
variables are renamed, gives 


ah af au af dv , af aw 


Ox ^ au dx * av ax * Bw Ox G) 
Similarly, 
ðh af du Əfəðv = Af dw G”) 
ðy ðu ðy dvdy dw day 
and 
9h af du , af dv , af dw Q") 


ðz Ou dz ðvəðz ðw ðz` 


These equations are exactly what would be obtained by multiplying out the matrices 
in equation (3). m 


PROOF OF THEOREM 11. The general case in equation (1) may be proved in 
two steps. First, equation (2) is generalized to m variables; that is, for f (x1, . ~- , Xm) 
and e(t) = (xı (t), .. - , Xm(t)), one has 


where A(t) = f (x(t), ..., Xm(t)). Second, the result obtained in the first step is used 
to obtain the formula 


àh; 3 af; dyr 


ax; £ Ay, xi 
where f = (fi, ..., fp) isa vector function ofarguments yi, ..., ys; g(xi, - <- , Xn) = 
WE - 22s X2) ess VQ, -- X) and hj(X1, .. -3 Xn) = fjnQas s. Xn) es 
Yn Qu; - - - , Xn)). (Using the letter y for both functions and arguments is an abuse of 


notation, but it can help one remember the formula.) This formula is equivalent to 
formula (1) after the matrices are multiplied out. 8 
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c(i) 


The path e(t) 


Differentiation 


The pattern of the chain rule will become clear once the student has worked some 
additional examples. For instance, 


a ðf ðu afdv adafdw  daf dz 
gx FU) v6 3) WHY 20 Y= aay + ay ax + Bw ax * Oz Ox" 
with a similar formula for 0f/dy. 

The chain rule can help us understand the relationship between the geometry of a 
mapping f: R? — R? and the geometry of curves in R?. (Similar statements may be 
made about R? or, generally, IR”.) If c(t) is a path in the plane, then as we saw in Section 
2.4, c'(t) represents the tangent (or velocity) vector of the path c(t), and this tangent 
(or velocity) vector is thought of as beginning at e(t). Now let p(t) = f(e(t)), where 
f: R? — R?, The path p represents the image of the path c(/) under the mapping f. 
The tangent vector to p is given by the chain rule: 


multiplication 


p'() 2 Df(c()) eo). 
L 


matrix column 
vector 


In other words, the derivative matrix of f maps the tangent (or velocity) vector of 
a path c to the tangent (or velocity) vector of the corresponding image path p (see 
Figure 2.5.1). Thus, points are mapped by f, while tangent vectors to curves are 
mapped by the derivative of f, evaluated at the base point of the tangent vector in 
the domain. 


p) = Dfic())e (n 


Figure 2.5.1 Tangent vectors 
are mapped by the derivative 
matrix. 


p(?) is the image of e(r) under f 


[SWISS] Verify the chain rule in the form of formula (3^) for 


S(u, v, w) = i? dw -w, 


where 


u(x, y, Z) = x?y, v(x, y, z) = y), w(x, y,z)=e*. 
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SOLUTION Here 
h(x, y, z) = f(x, y, z), vx, y, z), WO, y, z)) 
= (yy tyt er a xty? y 5e. 
Thus, differentiating directly, 


ðh 
— =4x3y* + ze. 
Ox 


On the other hand, using the chain rule, 


dh = af du Əfəðv Af aw 
ðx dudx Ovx dw Ax 


= (2x? y)(2xy) + ze^*, 


2u(2xy) + 2v- 0 + (—1)(—ze™*”) 


which is the same as the preceding equation. A 


Wow Given g(x, y) = (x? + 1, y?) and f(u, v) = (u + v, u, v?), com- 
pute the derivative of f o g at the point (x, y) = (1, 1) using the chain rule. 


SOLUTION The matrices of partial derivatives are 


afi aft 
du ðv "P 

Df, v= |? ah -f | and Pets.» = [o at 
ðu ðv 0 2v 0 2y 
8f 8f 
ðu ðv 


When (x, y) = (1, 1), note that g(x, y) = (u, v) = (2, 1). Hence, 


Lud 2 0 2 
D(f o gX1, 1) = D/Q, )Dg(1,1)— |1 0 F 3 20 
02 


N 


0 4 


is the required derivative. A 


DGIE] Let f(x, y) be given and make the substitution x = r cos0, y = 
r sin@ (polar coordinates). Write a formula for 9/90. 


SOLUTION By the chain rule, 


af _Əf ax | af dy 
80 0x00 «Ay 0 
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that is, 


a a a 
M = r sino oe +r eos 


vow 


Let f(x, y) = (cos y + x?, e*t») and g(u, v) = (e° , u — sin v). 
(a) Write a formula for f o g. (b) Calculate D( f o g)(0, 0) using the chain rule. 


SOLUTION (a) We have 


(f o g)u, v) = (eu — sin v) 
2u? e^ +u-sin 2 


= (cos (u — sinv) + e?" , e 
(b) By the chain rule, 
D(f o g)(0, 0) = [D/(g(0, 0))][Dg(0, 0)] = [Df(1, 0)][Dg(0. 0)]. 


Now 


Que 0 0 0 
wao 2] =p 5] 
1 —cos v (uv)=(0.0) 1 1 


and 


_ | 2x —siny _ {2 0 
»ra.9 - [2 guy . an le I 


[Remember that Df is evaluated at g(0, 0), not at (0, 0)!] Thus, 


Dy «so. = | SE -E :] a 


Let f: U C R” — R” be differentiable, with f = (fi,..., fm) 
and let g(x) = sin [ f (x) - /(x)]. Compute Dg(x). 

SOLUTION By the chain rule, Dg(x) = cos [ f (x) + f(x)]DA(x), where h(x) = 
Lf): f ()] = fe(x) +--+ + f(x). Then 
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which can be written 2 f (x)Df (x), where we regard f as a row matrix, 


af 
Ox, 
f= Tf c fn] and  Df-| : 
fn 
Oxy 


Thus, Dg(x) = 2[cos /(x) FASADE). & 


EXERCISES 


afi 


Ox, 


" 


0x, 
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1. If f: U C R” — R is differentiable, prove that x  /?(x) + 2 (x) is differentiable as 


well, and compute its derivative in terms of Df (x). 


2. Prove that the following functions are differentiable, and find their derivatives at an 


arbitrary point: 


(a) f:R?^—-R(xye2 

(b) f: R >R, œ, y) e x+y 
(c) f: R >R, (x,y) 2+x+y 
(d) f: R > R, (x,y) x? +y? 
(e) f: R > R, (x, y) 5 e" 


(f) f: U > R, (x, y) 5 /1— x? — y?, where U = ((x, y) | x + y? < 1} 


(g f: R > R, (x,y) x*- y 


3. Write out the chain rule for each of the following functions and justify your answer in 


each case using Theorem 11. 


(a) ðh/ð3x where h(x, y) = f(x, u(x, y) 
(b) dh/dx where h(x) = f(x, u(x), v(x)) 


(c) dh/ax where h(x, y, z) = f(u(x, y, z), v(x, y), w(x)) 


4. Verify the chain rule for 3h /ðx, where h(x, y) = f(u(x, y), v(x, y)) and 


u? +o? 


flu, v) = EnA u(x, y) = e™7, u(x, y) = e". 


5. Verify the first special case of the chain rule for the composition f o c in each of the 


cases: 


(a) f(x. y) = xy, e(t) = (e', cost) 

(b) f(x, y) =e”, e(t) = BP, t) 

(c) fGx, y) = (x? +?) log x? + y?, elt) = (e', e~) 
(d) f(x, y) = x exp(x? + y^), e(t) = (t, —t) 


6. What is the velocity vector for each path c(t) in Exercise 5? [The solution to part (b) only 


is in the Study Guide to this text.] 
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7. Let f: R? > Rand g: R? > R be differentiable. Prove that 


V(fg) = f Va  gVf. 
8. Let f: R? > R be differentiable. Making the substitution 
x =pcos@sing, y =psiné sing, z = pcosġ 


(spherical coordinates) into f(x, y, z), compute 3f/3p, 8/80, and 3f/ð¢ğ in terms of 
9f /üx, 9f/8y, and af/adz. 


9. Let f(u, v) = (tan (u — 1) — e", u? — v?) and g(x, y) = (e*, x — y). Calculate f o g 
and D(f o g)(1, 1). 


10. Let f(u, v, w) = (e"^", cos(v + u) + sin(u + v + w)) and g(x, y) = 
(e*, cos( y — x), e). Calculate f o g and D(f o g)(0, 0). 


11. Find (0/8s)(f o T)(1, 0), where f(u, v) = cos u sin v and T: IR? — R? is defined by 
T(s, t) = (cos (t?s), log V1 + s?). 

12. Suppose that the temperature at the point (x, y, z) in space is T(x, y, z) = x? + y? - z^. 
Let a particle follow the right-circular helix o(t) = (cos f, sin f, t) and let T(t) be its 
temperature at time f. 


(a) What is 7"(r)? 
(b) Find an approximate value for the temperature at t = (2/2) + 0.01. 


13. Suppose that a duck is swimming in the circle x = cost, y = sin? and that the water 
temperature is given by the formula T = x?e” — xy?. Find d T /dt, the rate of change in 
temperature the duck might feel: (a) by the chain rule; (b) by expressing T in terms of f and 
differentiating. 


14. Let f: IR" — IR" be a linear mapping so that (by Exercise 20, Section 2.3) D/(x) is the 
matrix of f. Check the validity of the chain rule directly for linear mappings. 


15. Let f: R? — R^; (x, y) e» (e*t, e"). Let e(t) be a path with e(0) = (0, 0) and 
€'(0) = (1, 1). What is the tangent vector to the image of c(t) under f at ¢ = 0? 


16. Let f(x, y) = l//x? + y?. Compute Vf(x, y). 


17. (a) Let y(x) be defined implicitly by G(x, y(x)) = 0, where G is a given function of two 
variables. Prove that if y(x) and G are differentiable, then 


dy | 8Gfàx  , 8G 


E e gi. 
dx ^ Gfày | ày^ 


(b) Obtain a formula analogous to that in part (a) if yı, y2 are defined implicitly by 
Gi(x, yix), y2(x)) = 0, 
G(x, yi(x), yo(x)) = 0. 
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(c) Let y be defined implicitly by 
xy reo. 
Compute dy/dx in terms of x and y. 


18. Thermodynamics texts* use the relationship 


dy \ (dz) (ax\ — 1 

ax)\ay)/\azJ | 
Explain the meaning of this equation and prove that it is true. [HINT: Start with a relationship 
F(x, y, z) = 0 that defines x = f(y, z), y = g(x, z), and z = h(x, y) and differentiate 
implicitly.] 
19. Dieterici’s equation of state for a gas is 


P(V — bye" "T = RT, 


where a, b, and R are constants. Regard volume V as a function of temperature T and 


pressure P and prove that 
aV R+ a RT a 
aT TV V-b Wy 


20. This exercise gives another example of the fact that the chain rule is not applicable if f is 
not differentiable. Consider the function 


xy? 
finie Ex (x, y) # (0, 0) 


0 (x, y) = (0, 0). 


Show that 


(a) df/ax and af/dy exist at (0, 0). 
(b) If g(t) = (at, bt) for constants a and b, then f o g is differentiable and ( f o g)'(0) = 
ab? /(a? + b^), but Vf(0, 0) + g'(0) = 0. 


21. Prove that if f: U C IR" — R is differentiable at xp € U, there is a neighborhood V of 
0 € R” and a function R: V — R such that for all h € V, we have xo +h € U, 


(xo + h) = f(xo) + [Df (xo)]h + Ri(h) 


4See S. M. Binder, “Mathematical Methods in Elementary Thermodynamics,” J. Chem. Educ. 43 (1966): 85-92. A 
proper understanding of partial differentiation can be of significant use in applications; for example, see M. Feinberg, 
“Constitutive Equation for Ideal Gas Mixtures and Ideal Solutions as Consequences of Simple Postulates,” Chem. Eng. 
Sci. 32 (1977): 75-78. 
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and 


Rh) 


0 as h— 0. 
Ihi 


22. Suppose xo € IR" and 0 < rı < r2. Show that there is a C! function f: R” — IR such 
that f(x) = 0 for |x — xoll > 72:0 < f(x) < 1 forr < ||x — xoll < r2; and f(x) = 1 for 
lx — xol| < ri. [Hint: Apply a cubic polynomial with g(r?) = 1 and g(r2) = g'(r2) = 
g'(r2) = 0to [x — xol? when ri < ||x — xoll < r2.] 


23. Finda C! mapping f: R? — R? that takes the vector i + j + k emanating from the 
origin to i — j emanating from (1, 1, 0) and takes k emanating from (1, 1, 0) tok — i 
emanating from the origin. 


24. What is wrong with the following argument? Suppose w = f(x, y, z) and z = g(x, y). 
By the chain rule, 


dw dwdadx dw dy & ðwðz aw 5 ðw dz 
ðx  ðxðx Ay Ox Oz 0x  ðx  əðzəðx` 


Hence, 0 = (8w/8z)(8z/8x), and so dw/dz = 0 or 4z/Ax = 0, which is, in general, 
absurd. 


25. Prove rules (iii) and (iv) of Theorem 10. (HNT: Use the same addition and subtraction 
tricks as in the one-variable case and Theorem 8.) 


26. Show that h: R” — R” is differentiable if and only if each of the m components 
hi: R” — R is differentiable. (HT: Use the coordinate projection function and the chain 
tule for one implication and consider 


L — h(xo) — DA(xo)(x — 24 Mah) — hi (Xo)Dhi(Xo)(X — xor 


lix — xoll Ix — xoll? 
to obtain the other.) 


27. Use the chain rule and differentiation under the integral sign, namely, 


d [^ ^ gf 
=f Fes. yydy = f ax END 
to show that 
d [* * gf 
a; [| fe tro re [ Lena. 
28. For what integers p > 0 is 
x? sin(I/x) x X0 
x= 


fe) = k : 


differentiable? For what p is the derivative continuous? 
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29. Suppose f: IR" — IR and g: IR" — IR" are differentiable. Show that the product function 
A(x) = f(x)g(x) from R” to R” is differentiable and that if xy and y are in IR", then 
[DA (xo)]y = /(xo)([Dg(xo)Iy) + ([Df(xo)ly)g(xo). 


2.6 Gradients and Directional Derivatives 


In Section 2.1 we studied the graphs of real-valued functions. Now we take up this 
study again, using the methods of calculus. Specifically, gradients will be used to 
obtain a formula for the plane tangent to a level surface. 


Gradients in R? 


Let us recall the definition. 


DEFINITION: The Gradient If f: U C R? > Ris differentiable, the gra- 
dient of f at (x, y, z) is the vector in space given by 


v= (8.95 9n) 
~ ax’ ay’ az) 


This vector is also denoted V f(x, y, z). Thus, Vf is just the matrix of the deriva- 
tive Df, written as a vector. 


DVA Let f(x,y,z) — yx? + y? +z? =r, the distance from 0 to 


(x, y, z). Then 


viu) 


VG y2) = (a. ay’ az 


M ( x y Zz ) Ë 
Ve+y4+2 Sx +y +22’ Yx tya r’ 
where ris the point (x, y, z). Thus, V f is the unit vector in the directionof(x, y, z). A 


DON vinwowe If f(x, y, z) = xy + z, then 


VfG yz) = (Z. u, z) =(y,x,1). A 
x dy Oz 
Suppose f: R? — R isa real-valued function. Let v and x € R? be fixed vectors 
and consider the function from R to R defined by t +> f(x + tv). The set of points 
of the form x + tv, t € R, is the line L through the point x parallel to the vector v 
(see Figure 2.6.1). 
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tv translated 


Figure 2.6.1 The equation of L is (t) = x + tv. 


Directional Derivatives 


The function t +> f(x + tv) represents the function f restricted to the line L. For 
example, if a bird flies along this line with velocity v so that x + fv is its position at 
time t, and if f represents the temperature as a function of position, then f(x + tv) 
is the temperature at time /. We may ask: How fast are the values of f changing 
along the line L at the point x? Because the rate of change of a function is given by a 
derivative, we could say that the answer to this question is the value of the derivative 
of this function of t at t = 0 (when t = 0, x + tv reduces to x). This would be the 
derivative of f at the point x in the direction of L, that is, of v. We can formalize this 
concept as follows. 


DEFINITION: Directional Derivatives If f: R? > R, the directional 
derivative of f at x along the vector v is given by 


d 
ani + tv) i 


if this exists. 


In the definition of a directional derivative, we normally choose v to be a 
unit vector. In this case we are moving in the direction v with unit speed and we 
refer to 4 f(x + tv)|;-o as the directional derivative of f in the direction v. 


We now elaborate on why a unit vector is chosen in the definition ofthe directional 
derivative. Suppose that f measures the temperature in degrees and that we are 
interested in how fast the temperature changes as we move in a particular direction. 
If we are measuring distance in meters, then the rate of change of temperature will 
be measured in degrees per meter. Suppose, for simplicity, that the temperature is 
changing at a constant rate—say, two degrees per meter—as we move in a given 
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direction v starting at x. Thus, when we go one meter ahead, the temperature changes 
by two degrees. That is, 


f(x v)- f(x) =2 


Such a relation is going to hold only when v is a unit vector, reflecting the fact that 
we are going ahead by one meter. More generally, the definition of the directional 
derivative is going to truly measure only the rate of change of f with respect to 
distance along a line in a given direction if v is a unit vector. 

From the definition, we can see that the directional derivative can also be defined 
by the formula 

time G8 7 FO) 
0 h 

THEOREM 12 If f: R? — R is differentiable, then all directional derivatives 
exist. The directional derivative at x in the direction v is given by 


D/(x)v = grad f(x): v = Vf(x)- v = Z w " E e) - [2 e| ii 
x ay az 


where v = (vi, v2, v3). 


PROOF Let c(t) = x + tv, so that f(x + tv) = f(c(t)). By the first special case 
of the chain rule, (d/dt) f(c(t)) = V.f(c(t)) - €'(t). However, c(0) = x and e'(0) = v, 
and so 


L fat) T" = Vf(x)- v, 


as we were required to prove. W 


Notice that one does not have to use straight lines when computing the rate of 
change of f in a specific direction v. Indeed, for a general path c(t) with c(0) = x and 
€'(0) = v, we have from the chain rule, 


d , 
£ pew) = vreo) = Vf(3)- v. 
t 1-0 1-0 
SENIDE] Let f(x, y, z) = x?e?". Compute the rate of change of f in the 


direction of the unit vector 


b ol 


v= (= A) at the point (1, 0, 0). 
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SOLUTION The required rate of change is, using Theorem 12, 
yz (22077 2 yp Vz 1 | 1 
Vf -v = Qxe **, —x^ze *, -x*ye ??). | —, ~, — ]. 


which, at the point (1, 0, 0), becomes 


one (dede) * 


Directions of Fastest Increase 
From Theorem 12 we can also obtain the geometrical significance of the gradient: 


THEOREM 13 Assume Vf(x) z 0. Then V/(x) points in the direction along 
which f is increasing the fastest. 


PROOF If n is a unit vector, the rate of change of f in direction n is given 
by Vf(x)-n = ||Vf(x)| cos0, where 0 is the angle between n and V/(x). This is 
maximum when 0 = 0; that is, when n and V f are parallel. [If V f(x) = 0 this rate of 
change is 0 for any n.) $m 


In other words, if one wishes to move in a direction in which f will increase most 
quickly, one should proceed in the direction V f (x). Analogously, if one wishes to move 
in a direction in which f decreases the fastest, one should proceed in the direction 


—Vf(x). 


AA In what direction from (0, 1) does f(x, y) = x? — y? increase the 
fastest? 


SOLUTION The gradient is 

Vf = 2xi—2y], 
and so at (0, 1) this is 

Vf lo. = —2i- 


By Theorem 13, f increases fastest in the direction —j. (Can you see why this answer 
is consistent with Figure 2.1.9?) 4 


Gradients and Tangent Planes to Level Sets 


Now we find the relationship between the gradient of a function f and its level 
surfaces. The gradient points in the direction in which the values of f change most 
rapidly, whereas a level surface lies in the directions in which they do not change 
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at all. If f is reasonably well behaved, the gradient and the level surface will be 
perpendicular. 


THEOREM 14: The Gradient is Normal to Level Surfaces Let 
f: R? > R be a C! map and let (xo, yo, zo) lie on the level surface S defined 
by f(x, y, z) =k, for k a constant. Then Vf (xo, yo, zo) is normal to the level 
surface in the following sense: If v is the tangent vector at ¢ = 0 of a path c(t) in 
S with c(0) = (xo, yo, zo), then Vf(xo, yo, zo) * V = 0 (see Figure 2.6.2). 


VfGxo, Yo Zo) 


Vf Gros Yo» Zo) 

v translated parallel translated so 

that it begins Figure 2.6.2 Geometric significance of 
at (xoro zo) the gradient: Vf is orthogonal to the 


surface S on which f is constant. 


Co Yos Zo) 


PROOF Let c(t) lie in S; then f(c(t)) = k. Let v be as in the hypothesis; then 
v = e'(0). Hence, the fact that f (e(t)) is constant in t, and the chain rule give 


d 
o= 7 e| =Vf(C)-v. m 
1=0 


If we study the conclusion of Theorem 14, we see that it is reasonable to define 
the plane tangent to S as the orthogonal plane to the gradient. 


DEFINITION: Tangent Planes to Level Surfaces Let S be the surface 
consisting of those (x, y, z) such that f(x, y, z) = k, for k a constant. The tan- 
gent plane of S at a point (xo, yo, zo) of S is defined by the equation 


Vf (Xo, Yo, zo) * (X — Xo. y — Yo. Z — zo) = 0 (1) 


if Vf (xo, yo, zo) # 0. That is, the tangent plane is the set of points (x, y, z) that 
satisfy equation (1). 
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This extends the definition we gave earlier for the tangent plane of the graph of 
a function (see Exercise 11 at the end of this section). 


Compute the equation of the plane tangent to the surface defined 
by 3xy + z? = 4 at (1, 1, 1). 


SOLUTION Here f(x, y,z)=3xy+z? and Vf —(3y,3x,2z) which at 
(1, 1, 1) is the vector (3, 3, 2). Thus, the tangent plane is 


(3,3,2).(x—-Lly-Lz—1)20; 
that is, 
3x+3y+2z=8. A 


In Theorem 14 and the definition following it, we could just as well have worked 
in two dimensions as in three. Thus, if we have f: R? — R and consider a level curve 


C= {@,y)| fe») = kh, 


then Vf (xo, yo) is perpendicular to C for any point (xo, yo) on C. Likewise, the tangent 
line to C at (xo, yo) has the equation 


Vf (xo, Yo) (x — xo, y — Yo) = 0 (2) 
if Vf (xo, yo) Æ 9; that is, the tangent line is the set of points (x, y) that satisfy equation 
(2) (see Figure 2.6.3). 
y 
Vf translated 


Figure 2.6.3 In the plane, the gradient Vf is 


Tangent line to C orthogonal to the curve f — constant. 


'The Gradient Vector Field 


We often speak of Vf as a gradient vector field. The word “field” means that Vf 
assigns a vector to each point in the domain of f. In Figure 2.6.4 we describe the 
gradient Vf not by drawing its graph, which, if f: IR? — R, would be a subset of 
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R, that is, the set of tuples (x, Vf (x)), but by representing V/(P), for each point P, 
as a vector emanating from the point P rather than from the origin. Like a graph, this 
pictorial method of depicting V f contains the point P and the value Vf (P) in the same 
picture. 


Figure 2.6.4 The gradient Vf ofa function f: R? > R 
is a vector field on R?; at each point P;, V/(P;) is a vector 
emanating from P;. 


V/GC;) 


The gradient vector field has important geometric significance. It shows the 
direction in which f is increasing the fastest and the direction that is orthogonal to 
the level surfaces (or curves in the plane) of f. That it does both of these at once is 
quite plausible. To see this, imagine a hill as shown in Figure 2.6.5(a). Let h be the 
height function, a function of two variables. If we draw level curves of h, these are 
just level contours of the hill. We could imagine them as level paths on the hill [see 
Figure 2.6.5(b)]. One thing should be obvious to anyone who has gone for a hike: To 
get to the top of the hill the fastest, we should walk perpendicular to level contours.? 
This is consistent with Theorems 13 and 14, which state that the direction of fastest 
increase (the gradient) is orthogonal to the level curves. 


DOWIE SA The gravitational force on a unit mass m at (x, y, z) produced by 
a mass M at the origin in R? is, according to Newton's law of gravitation, given by 


GmM 


ye 


n, 


where G is a constant; r = ||r|| = yx? + y? + z?, which is the distance of (x, y, z) 
from the origin; and n = r/r, the unit vector in the direction of r = xi 4- yj + zk, 
which is the position vector from the origin to (x, y, z). 

Note that F = V(GmM/r) = —VV, that is, F is the negative of the gradient 
of the gravitational potential V = —Gm M/r. This can be verified as in Example 1. 


5This discussion assumes that one walks at the same speed in all directions. Of course, hikers know that this is not 
necessarily realistic. 


170 


Differentiation 


A curve of Contour map of a hill 
steepest ascent 250 feet high 
up the hill 


() (b) 


Figure 2.6.5 A physical illustration of the two facts (a) Vf is the direction of 
fastest increase of f, and (b) Vf is orthogonal to the level curves. 


Notice that F is directed inward toward the origin. Also, the level surfaces of V are 
spheres. The gradient vector field F is normal to these spheres, which confirms the 
result of Theorem 14. 4 


Sa] Find a unit vector normal to the surface S given by z = x?y? + 
y + 1 at the point (0, 0, 1). 

SOLUTION Let f(x, y, z) = x?y? + y + 1 — z, and consider the level surface 
defined by f(x, y, z) = 0. Because this is the set of points (x, y, z) with z = x? y? + 
y + 1, we see that this level set coincides with the surface S. The gradient is given by 


9f. af, Of 
cud ee 


Vf(x, y, z) k = 2xy?i + Qx? y + Dj — k, 


and so 
Vf(0,0,1) 2j — k. 


This vector is perpendicular to S at (0, 0, 1), and so to find a unit normal n we divide 
this vector by its length to obtain 


| Vf@,0,1) 1, 
n= Tvoo Dp 7 V; A 


Consider two conductors, one charged positively and the other 
negatively. Between them, an electric potential is set up. This potential is a function 
à: R? — R (an example of a scalar field). The electric field is given by E = —Vó. 
From Theorem 14 we know that E is perpendicular to level surfaces of ġ. These level 
surfaces are called equipotential surfaces, because the potential is constant on them 
(see Figure 2.6.6). 4 
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Lines of constant à 


Figure 2.6.6 Equipotential surfaces (the dotted lines) 
are orthogonal to the electric force field E. 


EXERCISES 
1. Show that the directional derivative of f(x, y, z) = zx + y? at (1, 1, 2) in the direction 
(1/4/5)i + (2/4/5)j is 24/5. 


2. Compute the directional derivatives of the following functions at the indicated points in 
the given directions: 


(a) fŒ, y) =x + 2xy — 3y^, (xo. yo) = (1,2), v = ži + $j 

(b) fx, y) = log Vx? d y, (xo. yo) = (1,0), v = (1/V5)(2i + j) 

(c) fŒ, y) = æ cosGry), (xo, yo) = (0, —1), v = —(1/5)i + 2/4/5)j 
(d) f(x, y) = xy? +x, (xo. yo) = (4, —2), v = (1/410)i + 3/4/10)j 


3. Compute the directional derivatives of the following functions along unit vectors at the 
indicated points in directions parallel to the given vector: 


(a) fŒ, y) =x”, (xo, yo) = (e, e), d = Si + 12j 


(b) fœ, y, z) = e* + yz, (xo, yo, zo) = (1, 1, 1), d = (1, —1, 1) 
(c) f(x, y, z) = xyz, (Xo. yo, zo) = (1, 0, 1), d = (1,0, —1) 


4. Find the planes tangent to the following surfaces at the indicated points: 
(a) x? + 2y? + 3xz = 10, at the point (1, 2, 1) 


(b) y? — x? = 3, at the point (1, 2, 8) 
(c) xyz = 1, at the point (1, 1, 1) 


5. Find the equation for the plane tangent to each surface z = f(x, y) at the indicated 
point: 
(a) z — x? + y? — 6xy, at the point (1, 2, —3) 
(b) z = (cos x (cos y), at the point (0, 7/2, 0) 
(c) z = (cos x)(sin y), at the point (0, 7/2, 1) 


6. Compute the gradient V/ for each of the following functions: 
(a) SX, yz) = M /x* +y +2? 
(b) f(x. y.z) o xy + yz xz 


© fado seas 
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7. For the functions in Exercise 6, what is the direction of fastest increase at (1, 1, 1)? [The 
solution to part (c) only is in the Study Guide to this text.] 


8. Show that a unit normal to the surface x? y? + y — z + 2 = 0 at (0, 0, 2) is given by 
n= (1/V2\(j — k). 


9. Find a unit normal to the surface cos(xy) = e^ — 2 at (1, 7, 0). 
10. Verify Theorems 13 and 14 for f(x, y, z) =x? +y? +2’. 


11. Show that the definition following Theorem 14 yields, as a special case, the formula for 
the plane tangent to the graph of f(x, y) by regarding the graph as a level surface of 
F(x, y, z) = f(x, y) — z (see Section 2.3). 


12. Let f(x, y) = —(1 — x? — y?)!/? for (x, y) such that x? + y? < 1. Show that the plane 
tangent to the graph of f at (xo, yo, (Xo, yo)) is orthogonal to the vector with components 
(Xo. Yo; J (Xo, yo)). Interpret this geometrically. 


13. For the following functions f: R? — R and g: R > R’, find Vf and g’ and evaluate 
Cf o gy). 

(a) f(x, y,z) 2 xz + yz + xy, g(t) = (e', cost, sint) 

(b) fŒ, y, z) = e7, gir) = (6t, 377, 0) 

(c) fæ y, z) = G7 +y? +2?) log yx? +y? +27, g(t) = (e', e™, t) 


14. Compute the directional derivative of f in the given directions v at the given points P. 


(a) f(x, y, z) = xy? +72 + zix, P = (4, —2, —1), v = 1/A/ Mi + 3j + 2k) 
(b) f(x, yz) 2x", P=(e,e,0), v= i+ 5i sk 


15. Letr = xi + yj 4- zk andr = |r|]. Prove that 


16. Captain Ralph is in trouble near the sunny side of Mercury. The temperature of the ship's 
hull when he is at location (x, y, z) will be given by T(x, y, z) = e" 2 —* , where x, y, 
and z are measured in meters. He is currently at (1, 1, 1). 


(a) In what direction should he proceed in order to decrease the temperature most 
rapidly? 

(b) If the ship travels at e° meters per second, how fast will be the temperature decrease 
if he proceeds in that direction? 

(c) Unfortunately, the metal of the hull will crack if cooled at a rate greater than /14e? 
degrees per second. Describe the set of possible directions in which he may proceed to bring 
the temperature down at no more than that rate. 


17. A function f: IR? — R is said to be independent of the second variable if there is a 
function g: IR — RR such that f(x, y) — g(x) for all x in IR. In this case, calculate Vf in 
terms of g’. 
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18. Let f and g be functions from IR? to IR. Suppose f is differentiable and Vf (x) = g(x)x. 
Show that spheres centered at the origin are contained in the level sets for f; that is, f is 
constant on such spheres. 


19. A function f: IR" — R is called an even function if f(x) = f(—x) for every x in R". If 
f is differentiable and even, find D f at the origin. 


20. Suppose that a mountain has the shape of an elliptic paraboloid z — c — ax? — by?, 
where a, b, and c are positive constants, x and y are the east-west and north-south map 
coordinates, and z is the altitude above sea level (x, y, z are all measured in meters). At the 
point (1, 1), in what direction is the altitude increasing most rapidly? If a marble were 
released at (1, 1), in what direction would it begin to roll? 


21. An engineer wishes to build a railroad up the mountain of Exercise 20. Straight up the 
mountain is much too steep for the power of the engines. At the point (1, 1), in what 
directions may the track be laid so that it will be climbing with a 3% grade—that is, an 
angle whose tangent is 0.03? (There are two possibilities.) Make a sketch of the situation 
indicating the two possible directions for a 3% grade at (1, 1). 


22. In electrostatics, the force P of attraction between two particles of opposite charge is 
given by P = K(r/||r|?) (Coulomb 5 law), where k is a constant and r = xi + yj + zk. Show 
that P is the gradient of f = —K/||rl|. 


23. The electrostatic potential V due to two infinite parallel filaments with linear charge 
densities A and —A is V = (à/27 £0) In (r2/r1), where r? = (x — xo)? + y? andr? = 

(x + xo)? + y^. We think of the filaments as being in the z direction, passing through the xy 
plane at (—xo, 0) and (xo, 0). Find V V (x, y). 


24. For each of the following, find the maximum and minimum values attained by the 
function f along the path c(t): 

(a) f(x, y) = xy;e(t) = (cost, sint);0 < t < 2z. 

(b) f(x, y) =x? + y^; e(t) = (cost, 2sint);0 < t < 27. 


25. Suppose that a particle is ejected from the surface x? + y? — z? = —1 at the point 
(1, 1, V3) along the normal directed toward the xy plane to the surface at time ¢ = 0 with a 
speed of 10 units per second. When and where does it cross the xy plane? 


26. Let f: R? — R and regard D/(x, y, z) as a linear map of IR to IR. Show that the kernel 
(that is, the set of vectors mapped to zero) of Df is the plane in IR? orthogonal to Vf. 


REVIEW EXERCISES FOR CHAPTER 2 
1. Describe the graphs of: 


(a) f(x,y) 23x? +y (b) f(x,y) = xy +3x 


2. Describe some appropriate level surfaces and sections of the graphs of: 


@ f@y2=2+y? +2 
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(b fGyz-x 
(e) fG yz) m xyz 


3. Compute the derivative D/(x) of each of the following functions: 


(a) SE, y) = Gy, e?) 
(b) f(x) = (x, x) 

(0) Sx, y, J= +e +e 
(d) fœ, y, z) 2 (x,y,z) 


4. Suppose f(x, y) = f(y, x) for all (x, y). Prove that 
(8f/0xY(a, b) = (89f/8y(b, a). 


5. Let f(x, y) = (1 — x? — y?)"?. Show that the plane tangent to the graph of f at 
(x0, Yo. J (Xo, yo)) is orthogonal to the vector (xo, yo, (Xo, yo)). Interpret geometrically. 


6. Let F(u, v) and u = h(x, y, z), v = k(x, y, z) be given (differentiable) real-valued 
functions and let f(x, y, z) be defined by f(x, y, z) — F(h(x, y, z), k(x, y, z)). Write a 
formula for the gradient of f in terms of the partial derivatives of F, h, and k. 


7. Find an equation for the tangent plane of the graph of f at the point (xo, yo, (xo. yo)) 
for: 


@ f: RP ^ Ry) x - y £2. Gro» yo) = (1, 1) 


(b f: R? — R, (x, y) e» x? +4y’, (xo, yo) = (2, -1) 

(c) f: R? > R, Qr, y) xy, (xo, yo) = (71, —1) 

(d) f(x, y) = log(x + y) +x cos y + arctan(x + y), (xo, yo) = (1, 0) 

(e) f(x, y) 9 Vx* +y, (xo, yo) = (1, 1) 

(f) fœ, y) = xy, (xo. Yo) = (2, 1) 
8. Compute an equation for the tangent planes of the following surfaces at the indicated 
points. 

(a) x? +y? +2? =3, (1, 1, 1) 

(b) x3 — 2y? +23 (1, 1, 1) 

(c) (cosx)(cos y)e* = 0, (7/2, 1, 0) 

(d) e” =1, (1, 1,0) 


9. Draw some level curves for the following functions: 


(a) f(x, y) = I/xy 
(b) f(x,y) =x? -xy -y 


10. Consider a temperature function T(x, y) = x sin y. Plot a few level curves. Compute VT 
and explain its meaning. 


11. Find the following limits if they exist: 


(ij. tint. 2" (b) limit VIG Xy x e» 


G5.) (0,0) x (ey) (0,0) 
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12. Compute the first partial derivatives and gradients of the following functions: 


(a) f(x, y, z) 2 xe? + ycosx 
(b) f(x, y, z) 9 (x +y +z)! 
(c) f(x. y, z) = x? + yz 


ð 
13. Compute axl exp(1 +x? + y?) 
X 


14. Let y(x) be a differentiable function defined implicitly by F(x, y(x)) — 0. From Exercise 
17(a), Section 2.5, we know that 

dy _ _3F/ðx 

dx | 8F[üy 


Consider the surface z — F(x, y), and suppose F is increasing as a function of x and as a 
function of y; that is, dF /dx > 0 and 3F/ðy > 0. By considering the graph and the plane 

z = 0, show that for z fixed at z = 0, y should decrease as x increases and x should decrease 
as y increases. Does this agree with the minus sign in the formula for dy/dx? 


15. (a) Consider the graph of a function f(x, y) [Figure 2.R.1(a)]. Let (xo, yo) lie on a level 
curve C, so Vf(xo, yo) is perpendicular to this curve. Show that the tangent plane of the 
graph is the plane that (i) contains the line perpendicular to Vf (xo, yo) and lying in the 
horizontal plane z = f(xo, yo), and (ii) has slope || Vf (xo, yo)|| relative to the xy plane. (By 
the slope of a plane P relative to the xy plane we mean the tangent of the angle 0, 0 < 0 < 7, 
between the upward-pointing normal p to P and the unit vector k.) 

(b) Use this method to show that the tangent plane of the graph of f(x, y) — 
(x + cos y)x? at (1, 0, 2) is as sketched in Figure 2.R.1(b). 


Slope of tangent plane = ||V f || z 
Level curve raised to graph 

[CET 

Graph f 


oyo) 


Level curve 
Vf Gro» Yo) 


(a) (b) 


Figure 2.R.1 (a) The relationship between the gradient of a function and the 
tangent plane to the graph [Exercise 15(a)]. A specific instance of the plane in 
(b) for Exercise 15(b). 


176 


Differentiation 


16. Find the plane tangent to the surface z = x? + y? at the point (1, —2, 5). Explain the 
geometric significance, for this surface, of the gradient of f(x, y) = x? + y? (see 
Exercise 15). 


17. In which direction is the directional derivative of f(x, y) = (x? — y*)/(x? + y?) at (1, 1) 
equal to zero? 


18. Find the directional derivative of the given function at the given point and in the direction 
of the given vector. 


(a) f(x, y, z) = e cos( yz), Po = (0, 0, 0), v=(2, 1,—2) 
(b) f(x,y,z) =xy +yz + zx, Po = (1,1,2), v = (10, —1, 2) 


19. Find the tangent plane and normal to the hyperboloid x? + y? — z? = 18 at (3, 5, —4). 


20. Let (x(t), y(t)) be a path in the plane, 0 < t < 1, and let f(x, y) be a C' function of 
two variables. Assume that (dx /dt) fx + (dy/dt) f, < 0. Show that f(x(1), y(1)) < 
fœ(0), y(0)). 


21. A bug finds itself in a toxic environment. The toxicity level is given by T(x, y) = 
2x? — 4y?. The bug is at (—1, 2). In what direction should it move to lower the toxicity the 
fastest? 


22. Find the direction in which the function w = x? + xy increases most rapidly at the 
point (—1, 1). What is the magnitude of Vw at this point? Interpret this magnitude 
geometrically. 


23. Let f be defined on an open set 5 in R”. We say that f is homogeneous of degree p over 
S if f(Ax) = A? f (x) for every real A and for every x in S for which Ax € S. 


(a) If such a function is differentiable at x, show that x - V/(x) = pf (x). This is known 
as Euler’s theorem for homogeneous functions. [HrNT: For fixed x, define g(A) = /(Ax) and 
compute g'(1).] 

(b) Find p and check Euler’s theorem for the function f(x, y, z) =x — 2y — xz, on 
the region where xz > 0. 


24. Ifz = [f(x — y)]/y (where f is differentiable and y # 0), show that the identity 
z + y(0z/dx) + y(0z/dy) = 0 holds. 


25. Givenz = f((x + y)/(x — y)) for f a C! function, show that 


26. Let f have partial derivatives 3f (x)/8x;, where i = 1,2, ..., n, at each point x of an 
open set U in IR". If f has a local maximum or a local minimum at the point xo in U, show 
that 0f(x9)/9x; = 0 for each i. 
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27. Consider the functions defined in R? by the following formulas: 


(i) fo. y) m xy/Q^ y) if Q»)*(00. — /(00)—0 
(i) fG y) =x +y) if Q.»)x (0.0. — /(0,0)—0 


(a) In each case, show that the partial derivatives 3f (x, y)/3x and 3f (x, y)/8y exist for 
every (x, y) in R?, and evaluate these derivatives explicitly in terms of x and y. 

(b) Explain why the functions described in (i) and (ii) are or are not differentiable at 
(0, 0). 


28. Compute the gradient vector V/(x, y) at all points (x, y) in IR? for each of the following 
functions: 


(a) fœ, y)=x y log +y) if (x.y) #0),  f(0,0)=0 
(b) fa y) — xysin[l/G? +y) if (y): (0,0, — f(0,0)—0 


29. Find the directional derivatives of the following functions at the point (1, 1) in the 
direction (i + j)/4/2: 


(a) f(x, y) =x tan! (x/y) 
(b) f(x, y) = cos (yx? + y?) 
(c) f(x, y) = exp (x? — y?) 


30. (a) Let u = i — 2j + 2k and v = 2i + j — 3k. Find: ||u||, u- v, u x v, and a vector in the 
same direction as u, but of unit length. 
(b) Find the rate of change of e” sin(xyz) in the direction u at (0, 1, 1). 


31. Let h(x, y) = 2e- + e?" denote the height on a mountain at position (x, y). In what 
direction from (1, 0) should one begin walking in order to climb the fastest? 


32. Compute an equation for the plane tangent to the graph of 


fe ur 


atx-l,y-2. 


33. (a) Give a careful statement of the general form of the chain rule. 
(b) Let f(x, y) = x? + y and h(u) = (sin 3u, cos 8u). Let g(u) = f(h(u)). Compute 
dg/du at u — 0 both directly and by using the chain rule. 


34. (a) Sketch the level curves of f(x, y) = —x? — 9y? forc = 0, —1, —10. 
(b) On your sketch, draw in Vf at (1, 1). Discuss. 


35. At time f = 0, a particle is ejected from the surface x? + 2y? + 3z? = 6 at the point 
(1, 1, 1) in a direction normal to the surface at a speed of 10 units per second. At what time 
does it cross the sphere x? + y? + z? = 103? 
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36. At what point(s) on the surface in Exercise 35 is the normal vector parallel to the line 
x=y=2? 


37. Compute dz/dx and dz/dy if 


u? v? 


-I 3 ucket?, v=e” 
"m 


z= 


(a) by substitution and direct calculation, and (b) by the chain rule. 
38. Compute the partial derivatives as in Exercise 37 if z = uv, u = x + y, and v = x — y. 


39. What is wrong with the following argument? Suppose that w = f(x, y) and y = x?. By 
the chain rule, 


ðw dwdx  ðwðy _ ðw ðw 
ðx ax ax ðyðx x ðy` 


Hence, 0 = 2x(@w/dy), and so ðw/3y = 0. Choose an explicit example to really see that this 
is incorrect. 


40. A boat is sailing northeast at 20 km/h. Assuming that the temperature drops at a rate of 
0.2°C/km in the northerly direction and 0.3°C/km in the easterly direction, what is the time 
rate of change of temperature as observed on the boat? 


41. Use the chain rule to find a formula for (d/dt) exp [ f (Ðg). 
42. Use the chain rule to find a formula for (d/dt)( f (t)€?). 


43. Verify the chain rule for the function f(x, y, z) = [In (1 + x? + 2z*)]/(1 + y?) and the 
path c(t) = (t, 1 — t°, cos t). 


44. Verify the chain rule for the function f(x, y) — x?/(2 -- cos y) and the path x — e', 
y-e'. 


45. Suppose that u(x, t) satisfies the differential equation u, + uu, = 0 and that x, as a 
function x = f(t) of t, satisfies dx /dt = u(x, t). Prove that u( f(t), t) is constant in t. 


46. The displacement at time / and horizontal position on a line x of a certain violin string is 
given by u = sin (x — 6t) + sin (x + 61). Calculate the velocity of the string at x = 1 when 
t=}, 

3 


47. The ideal gas law PV = nRT involves a constant R, the number n of moles of the gas, 
the volume V, the Kelvin temperature 7, and the pressure P. 


(a) Show that each of n, P, T, V is a function of the remaining variables, and determine 
explicitly the defining equations. 
(b) Calculate 3V /3T, 9T/8 P, 3P/3V and show that their product equals —1. 


Review Exercises 179 


48. The potential temperature 0 is defined in terms of temperature 7 and pressure p by 


1000 7^ 
= T| — 7 
p 


The temperature and pressure may be thought of as functions of position (x, y, z) in the 
atmosphere and also of time f. 


(a) Find formulas for 96/8x, 80/8y, 80/8z, 90/8t in terms of partial derivatives of T 
and p. 

(b) The condition 06/0z < 0 is regarded as an unstable atmosphere, for it leads to large 
vertical excursions of air parcels from a single upward or downward impetus. Meteorologists 
use the formula 


where g — 32.2 and C, is a positive constant. How does the temperature change in the 
upward direction for an unstable atmosphere? 


49, The specific volume V, pressure P, and temperature T of a van der Waals gas are related 
by P = RT/(V — B) — o/ V?, where a, B, and R are constants. 


(a) Explain why any two of V, P, and T can be considered independent variables that 
determine the third variable. 

(b) Find 37/8 P, 9P/8V,9V/8T. Identify which variables are constant, and interpret 
each partial derivative physically. 

(c) Verify that (97/8 P)(aP/AV (AV /AT) = —1( not +1!). 


50. The height ^ of the Hawaiian volcano Mauna Loa is (roughly) described by the function 
h(x, y) = 2.59 — 0.00024y? — 0.00065x?, where h is the height above sea level in miles and 
x and y measure east-west and north-south distances in miles from the top of the mountain. 
At (x, y) = (—2, —4): 


(a) How fast is the height increasing in the direction (1, 1) (that is, northeastward)? 
Express your answer in miles of height per mile of horizontal distance traveled. 
(b) In what direction is the steepest upward path? 


51. (a) In what direction is the directional derivative of f(x, y) = (x? — y?)/(x? + y?) at 
(1, 1) equal to zero? 

(b) How about at an arbitrary point (xo, yo) in the first quadrant? 

(c) Describe the level curves of f. In particular, discuss them in terms of the result 
of part (b). 


52. (a) Show that the curve x? — y? — c, for any value of c, satisfies the differential 
equation dy/dx — x/y. 

(b) Draw in a few of the curves x? — y? = c, say forc = +1. At several points (x, y) 
along each of these curves, draw a short segment of slope x / y; check that these segments 
appear to be tangent to the curve. What happens when y — 0? What happens when c — 0? 
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53. Suppose that f is a differentiable function of one variable and that a function 
u = g(x, y) is defined by 


xy 


u =e. =a 22), 


Show that u satisfies a (partial) differential equation of the form 


ðu ðu 
R E Gtx, 
ax > ay (x, yu 


and find the function G(x, y). 


54. (a) Let F bea function of one variable and f a function of two variables. Show that the 
gradient vector of g(x, y) = F( f(x, y)) is parallel to the gradient vector of f(x, y). 

(b) Let f(x, y) and g(x, y) be functions such that Vf = AVg for some function A(x, y). 
What is the relation between the level curves of f and g? Explain why there might be a 
function F such that g(x, y) = F(f(x, y)). 


3 


Higher-Order Derivatives: 
Maxima and Minima 


Leonhard Euler 
(by Emanuel 
Handman) 
(1707-1783). 


All that is superfluous displeases God and Nature. 
All that displeases God and Nature is evil. 


Dante Alighieri, circa 1300 


... namely, because the shape of the whole universe is most perfect, and, 
in fact, designed by the wisest creator, nothing in all of the world will 
occur in which no maximum or minimum rule is somehow shining forth. 


Leonhard Euler 


n one-variable calculus, to test a function f(x) for a local maximum or minimum, 
one often uses the second derivative. We look for critical points xo, that is, points xo 
for which f’(xo) = 0, and at each such point we check the sign of the second derivative 
f" (xo). If f" (xo) < 0, f(xo) is a local maximum of f; if f"(xo) > 0, f(xo) is a local 
minimum of f; if f"(xo) = 0, the test fails. 

This chapter extends these methods to real-valued functions of several variables. 
We begin in Section 3.1 with a discussion of iterated and higher-order partial deriva- 
tives, and in Section 3.2 we discuss the multivariable form of Taylor's theorem; this 
is then used in Section 3.3 to derive tests for maxima, minima, and saddle points. 
As with functions of one variable, such methods help one to visualize the shape of 
a graph. 

In Section 3.4, we study the problem of maximizing a real-valued function subject 
to supplementary conditions, also referred to as constraints. For example, we might 


181 


182 Higher-Order Derivatives: Maxima and Minima 


wish to maximize f(x, y, z) among those (x, y, z) constrained to lie on the unit sphere, 
x? + y? +2? = 1. Section 3.5 discusses a technical theorem (the implicit function 
theorem) useful for studying constraints. It will also be useful later in our study of 
surfaces. 


3.1 Iterated Partial Derivatives 


The preceding chapter developed considerable information concerning the derivative 
of a map and investigated the geometry associated with the derivative of real-valued 
functions by making use of the gradient. In this section, we proceed to study higher- 
order derivatives, with the goal of proving the equality of the *mixed second partial 
derivatives" of a function. We begin by defining the necessary terms. 

Let f:IR? > R be of class C!. Recall that this means that f/x, 9f//9y, and 
9f /0z exist and are continuous. If these derivatives, in turn, have continuous partial 
derivatives, we say that f is of class C?, or is twice continuously differentiable. 
Likewise, if we say f is of class C^, we mean f has continuous iterated partial 
derivatives of third order, and so on. Here are a few examples of how second-order 
derivatives are written: 


ar a (af Sf a ar) af s (ur Da 
əx? ax dx)’  ðxðy = ax ay)’  əðzəðy  əðz\ðy J’ d 


The process can, of course, be repeated for third-order derivatives, and so on. If f is 
a function of only x and y and 0f/0x, 0f/dy are continuously differentiable, then by 
taking second partial derivatives we get the four functions 


Spo of ef a Pf 
ax?’ ay?’ ax ay’ ay ax" 


All of these are called iterated partial derivatives, while 0?f/àx dy and 0?f/8y ax 
are called mixed partial derivatives. 


Find all second partial derivatives of f(x, y) = xy + (x + 2y}. 
SOLUTION The first partials are 


a a 
BP a pale dd UF es pate A. 
Ox ay 


Now differentiate each of these expressions with respect to x and y: 


a 2 
LE fF 6 
ax? ay? 

a a 
fo HL 


üx8üy  " ðyəðx 
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EXAMPLE 2| Find all second partial derivatives of f(x, y) — sinx sin? y. 
SOLUTION We proceed just as in Example 1: 


a 9 "e TA 
af — cosx sin’ y, of = 2sinx sin y cos y = sinx sin2y; 
Ox oy 
a a 
of — —sinx sin y, qur = 2sinx cos2y; 
a? ^ a : z 
as — cosx sin2y, T i = 2cosxsinycosy = cosxsin2y. A 
DENNIE] Let f(x, y, z) =e +zcosx. Then 
a a a 
af = ye? — zsinx, af zx. af = cosx, 
ox oy Oz 
9? 2 
f — —sinx, of sinx, etc. A 
0z Ox Ox dz 


The Mixed Partials are Equal 


Inall these examples note that the pairs of mixed partial derivatives, such as 3?f/3x dy 
and 3?f/ðy dx, or 0?f/0z Ax and 0?f/üx Jz, are equal. It is a basic and perhaps 
surprising fact that this is always the case for C? functions. We shall prove this in 
the next theorem for functions f(x, y) of two variables, but the proof can be readily 
extended to functions of n variables. 


THEOREM 1: Equality of Mixed Partials If f(x, y) is of class C? (is 
twice continuously differentiable), then the mixed partial derivatives are equal; 
that is, 


LEA 


üxüy  Oyüx. 


PROOF Consider the following expression (see Figure 3.1.1): 
S(Ax, Ay) = f(xo + Ax, yo + Ay) — f (xo + Ax, yo) 
— f (Xo, yo + Ay) + f (xo, yo). 
Holding yo and Ay fixed, define 
B(x) = fŒ, yo + Ay) — fŒ, yo). 
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x 
Co yo+ Ay) (xo + Ax, yo+ Ay) 
cy - + [D 
Figure 3.1.1 The algebra behind the equality 
of mixed partials: writing the difference of 
ai + 22 [^ differences in two ways. 
(Go, Yo) (Gro Ax, yo) 


so that S(Ax, Ay) = g(xo + Ax) — g(xo), which expresses S as a difference of differ- 
ences. By the mean-value theorem for functions of one variable, g(xo + Ax) — g(xo) 
equals g'(X)Ax for some ¥ between xo and xo + Ax. Hence, 


9, a 
S(Ax, Ay) = ELE + Ay)— Le yn) as 


Applying the mean-value theorem again, there is a y between yo and yo + Ay such 
that 


2 
S(Ax, AY = 2i A y)Ax Ay. 


Because 3?f/3y àx is continuous, it follows that 


9? 


Í 
limi S(Ax, A 
ay dy ax - (Ax, Ay 0.0 AxAy Ju ors All. 


Noting that S is symmetric in Ax and Ay, one shows in a similar way that 0?f/0x dy 
is given by the same limit formula, which proves the result. m 


The equality of mixed partial derivatives is one of the most important results 
of multivariable calculus. It will reappear on several occasions later in the 
book, when we study vector identities. 

In the next historical note, we will discuss the role of partial derivatives 


in the formulation of many of the basic equations governing physical 
phenomena. One of the giants in this era was Leonhard Euler (1707-1783), 
who developed the equations of fluid mechanics that bear his name—the 
Euler equations. It was in connection with the needs of this development that 
he discovered, around 1734, the equality of mixed partial derivatives. Euler 
was about 27 years old at the time. 
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In Exercise 11 we ask the reader to deduce from Theorem 1 that for a C? function 
of x, y, and z, 


af af vf 


= = " tc. 
8x8yüz dzdydx  8yüzüx' ~~ 


In other words, we can compute iterated partial derivatives in any order we please. 


|. IURE Verify the equality of the mixed second partial derivatives for the 
function 


f(x, y) 2 xe" + yx?. 
SOLUTION Here 
af af 


=e +y, = xe? +x’, 
Ox 4 ðy 
9? 9? 
I erdt. Lc mm 42x, 
dy Ox Ox dy 
and so we have 
rf = rf 
dydx dx dy" 


Sometimes the notation fy, fy, f; is used for the partial derivatives: f. = df/dx, 
and so on. With this notation, we write fyy = (f;),, and so equality of the mixed 
partials is denoted by f;, = fyx. Notice that f;,, = Ə? f/8y Ax, so the order of x and 
y is reversed in the two notations; fortunately, the equality of mixed partials makes 
this potential ambiguity irrelevant. The following example illustrates this subscript 
notation. 


z= f(x, y) =e" sinxy 
and write x = g(s, t), y = h(s, t) for certain functions g and A. Let 
k(s, t) = f(g(s, t), h(s, 1). 


Calculate ksr. 
SOLUTION By the chain rule, 


ks = figs + fyhs = (e sinxy + ye* cosxy)g; + (xe* cosxy)hs. 
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Differentiating in ¢ using the product rules gives 
kst = (fe): &s + F8) + (Ay), As + f), 
Applying the chain rule again to ( fx), and (fy), gives 
(f, = SexBt + fxyhi and (5), = fyx& + yl 
and so k,, becomes 


ks = (fx + Áxyhu)gs + fx8si + Cfyx gi "t Jyyhhs * Em 
= fuxgigs + fry(higs + hsgi) + fyyhihs + fx8si fyhsi 


Notice that this last formula is symmetric in (s, t), verifying the equality kst = kts- 
Computing fxx, fry, and fy, we get 


ky = (e sinxy + 2ye* cosxy — y e” sinxy)gig, 
+(xe* cos xy + e cosxy — xye* sinxy)(fh,g; + hsgr) 
— Qe sinxy)h,h, + (& sinxy + ye cos xy)gs + (xe* cos xy)hsr, 


in which it is understood that x = g(s, t) and y = A(s, f). A 


— Si tinal apte 


Some Partial Differential Equations 


Philosophy [nature] is written in that great book which ever is 
before our eyes—I mean the universe—but we cannot understand it 
if we do not first learn the language and grasp the symbols in which 
itis written. The book is written in mathematical language, and the 
symbols are triangles, circles and other geometrical figures, without 
whose help it is impossible to comprehend a single word of it; 
without which one wanders in vain through a dark labyrinth. 


GALILEO 


This quotation illustrates the Greek belief, again popular in the time of 
Galileo, that much of nature could be described using mathematics. In the 
latter part of the seventeenth century this thinking was dramatically 
reinforced when Newton used his law of gravitation to derive Kepler'sthree 
laws of celestial motion (see Section 4.1) to explain the tides, and to show that 
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the earth was flattened at the poles. The impact of this philosophy on 
mathematics was substantial, and many mathematicians sought to 
"mathematize"nature. The extent to which mathematics pervades the 
physical sciences today (and, to an increasing amount, economics and the 
social and life sciences) is testament to the success of these endeavors. 
Correspondingly, the attempts to mathematize nature have often led to new 
mathematical discoveries. 

Many of the laws of nature were described in terms of either ordinary 
differential equations (ODEs, equations involving the derivatives of 
functions of one variable alone, such as the laws of planetary motion) or 
partial differential equations (PDEs), that is, equations involving partial 
derivatives of functions. To give the reader some historical perspective and 
offer motivation for studying partial derivatives, we present a brief 
description of three of the most famous partial differential equations: the 
heat equation, the potential equation (or Laplace'sequation), and the wave 
equation. (Further information on some PDEs is given in Section 8.5.) 


'THE HEAT EQUATION. In the early part of the nineteenth century the 
French mathematician Joseph Fourier (1768-1830) took up the study of heat. 
Heat flow had obvious applications to both industrial and scientific 
problems: A better understanding of it would, for example, make possible 
more efficient smelting of metals and would enable scientists to determine 
the temperature of a body given the temperature at its boundary, and to 
approximate the temperature of the earth's interior. 

Let a homogeneous body B c R° (Figure 3.1.2) be represented by some 
region in 3-space. Let T(x, y, z, t) denote the temperature of the body at the 


x 


Figure 3.1.2 A homogeneous body in space. 
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point (x, y, z) at time t. Fourier proved, on the basis of physical principles 
(described in Section 8.5), that T must satisfy the partial differential equation 
called the heat equation, 
Se p. Be Se (1) 
ax y | 02] at’ 


where k is a constant whose value depends on the conductivity of the material 
comprising the body. 

Fourier used this equation to solve problems in heat conduction. In fact, 
his investigations into the solutions of equation (1) led him to the carey 
of Fourier series. 


THE POTENTIAL EQUATION. Consider the gravitational potential V (often 
called Newton's potential) of a mass m at a point (x, y, z) caused by a point 
mass M situated at the origin. This potential is given by V 2 —Gm M/r, 
wherer = yx? + y? + z?. The potential V satisfies the equation 


2V sv av 
EUM TE que Ry 
axe ay * ag (2) 


everywhere except at the origin, as we will check in the next chapter (see also 
Exercise 23). This equation is known as Laplace’s equation. Pierre-Simon de 
Laplace (1749-1827) had worked on the gravitational attraction of nonpoint 
masses and was the first to consider equation (2) with regard to gravitational 
attraction. He gave arguments (later shown to be incorrect) that equation (2) 
held for any body and any point whether inside or outside that body. 
However, Laplace was not the first person to write down equation (2). The 
potential equation appeared for the first time in one of Euler’smajor papers 
in 1752, “Principles of the Motions of Fluids,” in which he derived the 
potential equation with regard to the motion of (incompressible) fluids. Euler 
remarked that he had no idea how to solve equation (2). Poisson later showed 
that if (x, y, z) lies inside an attracting body, then V satisfies the equation 


Vv SV av 
Jx? ar ay? + mS 4mp, (3) 
where p is the mass density of the attracting body. Equation (3) is now called 
Poisson's equation. Poisson was also the first to point out the importance 

of this equation for problems involving electric fields. Notice that if the 
temperature T is constant in time, then the heat cquation (1) reduces to 
Laplace's equation (2). 
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Laplace'sand Poisson'sequations are fundamental to many fields 
besides fluid mechanics, gravitational fields, and electrostatic fields. For 
example, they are useful for studying soap films and liquid crystals (see The 
Parsimonious Universe: Shape and Form in the Natural World by S. 
Hildebrandt and A. Tromba, Springer-Verlag, New York/Berlin, 1995). 


THE WAVE EQUATION. The linear wave equation in space has the form 


Ox? 05? az? at? 


(4) 


The one-dimensional wave equation 


92r Gah f 
9x2 ^ ag @) 
was derived in about 1727 by Johann II Bernoulli and several years later by 
Jean Le Rond d'Alembertin the study of how to determine the motion of a 
vibrating string (such as a violin string). Equation (4) became useful in the 
study of both vibrating bodies and elasticity. As we shall see when we 
consider Maxwell's equations for electromagnetism in Section 8.5, this 
equation also arises in the study of the propagation of electromagnetic 
radiation and sound waves. 


DEW IGE The partial differential equation u; + xx + uu, = 0, called the 
Korteweg-de Vries equation (or KdV equation, for short), describes the motion of 
water waves in a shallow channel. 


(a) Show that for any positive constant c, the function 
u(x, t) = 3c sech?[}(x —ct)/e| 
is a solution of the Korteweg-de Vries equation. (This solution represents a 
traveling “hump” of water in the channel and is called a soliton.)! 


(b) How do the shape and speed of the soliton depend on c? 


SOLUTION (a) We compute u+, Ux, Uxx, and u,«, using the chain rule and the 
differentiation formula (d/dx) sech x — — sech x tanh x from one-variable calculus. 


!Solitons were first observed by J. Scott Russell around 1840 in barge canals near Edinburgh. He reported his results in 
Trans, R. Soc. Edinburgh 14 (1840): 47-109. 
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Letting a = (x — ct)4/c/2, 


a a 
u, = 6c sech ae sech a = —6c sech? a tanha == 


= 3c? sech? a tanha = c??u tanha. 


Also, 


ð 
Uy = —6c sech? a tanha—— 
ox 


= —3c3/* sech? æ tanha = —/cu tanh o, 


and so u; + cu, = 0 and 


Ve 


2 
Uxx = —Je [s tanh o + u(sech? a = —/e(tanha)u, — = 


2 
= c(tanb? w)u t = c(1 — sech? a)u 


wou? u? 
caus 6 cus 
Thus, 
Uxxx = CUy — Uly; that is, Uxxx + UUy = Cy. 
Hence, 


Ut + Uxxx + uuy = Ut + cu, = 0. 


(b) The speed of the soliton is c, because u(x + ct, t) = u(x, 0). The soliton is higher 
and thinner when c is larger. Its shape at time ¢ = 10 is shown in Figure 3.1.3. A 
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Figure 3.1.3 The graph of u(x, t) = 3 sech?(./e(x — ct)/2) forc = i at times 
(a) t = 0 and (b) t = 10. 
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In Exercises 1 to 6, compute the second partial derivatives Ə f [0x?, °f/dx ay, 9^f/8y Ax, 


9? f/8y* for each of the following functions. Verify Theorem 1 in each case. 
l. f(x, y) = 2xy/(x? + y’)’, on the region where (x, y) 4 (0, 0) 


2. f(x, y, z) =e? + (1/x) + xe, on the region where x # 0 


3. f(x. y) = cos(xy?) 
4. f(x, y) 2 e + yix4 
5. f(x, y) = l/(cos? x + e) 


6. f(x, y)= log(x — y) 


7. Find 9?z/8x?, 8°z/dx dy, 0^z/8y ax, and 9?z/8y? for 


(a) z=3x? +2y? 


(b) z = (2x? + 7x?y)/3xy, on the region where x Æ 0 and y #0 


8. Find all the second partial derivatives of 


(a) z = sin(x? — 3xy) 
(b) z = x? ye 


9. Find fry, f;., fex, and Joys for 


fyz) =x y xy! t yz. 


10. Letz = x*y? — x8 + y^. 


(a) Compute 9?z/8y dx Ax, 0?z/àx Ay Ax, and 0?z/àx Ax dy (also denoted 0?z/8x?8y). 
(b) Compute 0?z/àx dy dy, 0?z/8y Ax ay, and 3?z/ðy ay Ax (also denoted 9?z/3y?0x). 


11. Use Theorem 1 to show that if f(x, y, z) is of class C?, then 


as 


üxüyüz dydzax 


12. Verify that 


ar 


Ox dy az > 


for f(x, y, Z) = ze" + yzix?. 


13. Verify that f. = fewx for f(x, y, z, w) = e™ sin (xw). 


as 


9M 
Oz dy Ax 
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14. If f(x, y, z, w) is of class C?, show that fi; = fewx- 
15. Evaluate all first and second partial derivatives of the following functions: 


(a) f(x, y) = x arctan (x/y) 
(b) f(x, y) = cos yx? + y? 


(c) fœ, y) = exp(—x? — y’) 


16. Let w = f(x, y) be a function of two variables and let x = u + v, y = u — v. Show 
that 


w w dw 


ðuðv əx? ay? 


17. Let f: R? — R be a C? function and let ¢(t) be a C? curve in R?. Write a formula for 
the second derivative (d?/dt?)((f o c)(t)) using the chain rule twice. 


18. Let f(x, y, z) = e" tan (yz) and let x = g(s, t), y = h(s, t), z = k(s, t), and define the 
function m(s, t) = f(g(s, t), h(s, t), k(s, t)). Find a formula for ms, using the chain rule and 
verify that your answer is symmetric in s and f. 

19. A function u = f(x, y) with continuous second partial derivatives satisfying Laplace's 


equation 


u au 


atas 


is called a harmonic function. Show that the function u(x, y) = x? — 3x)? is harmonic. 


20. Which of the following functions are harmonic? (See Exercise 19.) 


(a) f(x,y) =x? -y (d) f(x, y) =y? +3x?y 
(b) f(x, y) =x? + y? (e) f(x, y) = sinx cosh y 
(c) f(x, y) 2 xy (f) f(x, y) 2 e' siny 


21. Let f and g be C? functions of one variable. Set ¢ = f(x — t) + g(x + t). 


(a) Prove that ¢ satisfies the wave equation: 09/81? = 0?$/0x?. 
(b) Sketch the graph of $ against t and x if f(x) = x? and g(x) = 0. 


22. (a) Show that function g(x, t) = 2 + e™ sin x satisfies the heat equation: g; = g;,. 
[Here g(x, t) represents the temperature in a metal rod at position x and time /.] 

(b) Sketch the graph of g for t > 0. (HNT: Look at sections by the planes t = 0, t = 1, 
and t = 2.) 

(c) What happens to g(x, t) as t — oc? Interpret this limit in terms of the behavior of 
heat in the rod. 
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23. Show that Newton's potential V = —Gm M/r satisfies Laplace’s equation 


Sy Sv v. 
ac ay? ao 


for (x, y, z) (0, 0, 0). 


24. Let 


[oe -»»/6 3». — Ga) (0,0) 
fe» | (x. y) = (0,0) 


(see Figure 3.1.4). 


(a) If (x, y) # (0, 0), calculate 8//8x and af/dy. 

(b) Show that (2//3x)(0, 0) = 0 = (af/dy)(0, 0). 

(c) Show that (82f/8x 3y)(0, 0) = 1, (82//8y ax)(0, 0) = —1. 
(d) What went wrong? Why are the mixed partials not equal? 


Figure 3.1.4 The graph of the function 
in Exercise 24. 


3.2 Taylor's Theorem 


When we introduced the derivative in Chapter 2, we saw that the linear approximation 
of a function played an essential role for a geometric reason—finding the equation 
of a tangent plane—as well as an analytic reason—finding approximate values of 
functions. Taylor's theorem deals with the important issue of finding quadratic and 
higher-order approximations. 

Taylor's theorem is a central tool for finding accurate numerical approxi- 
mations of functions, and as such plays an important role in many areas of 
applied and computational mathematics. We shall use it in the next section to de- 
velop the second derivative test for maxima and minima of functions of several 
variables. 

The strategy used to prove Taylor's theorem is to reduce it to the one-variable 
case by probing a function of many variables along lines of the form I(t) = xo + th 
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emanating from a point xo and heading in the direction h. Thus, it will be useful for 
us to begin by reviewing Taylor's theorem from one-variable calculus. 


Single-Variable Taylor Theorem 


When recalling a theorem from an earlier course, it is helpful to ask these basic 
questions: What is the main point of the theorem? What are the key ideas in the 
proof? Can I understand the result better the second time around? 

The main point of the single-variable Taylor theorem is to find approximations 
of a function near a given point that are accurate to a higher order than the linear 
approximation. The key idea in the proof is to use the fundamental theorem of calculus, 
followed by integration by parts. In fact, just by recalling these basic ideas, one can 
reconstruct the entire proof. Thinking in this way will help organize all the pieces that 
need to come together to develop a mastery of Taylor approximations of functions of 
one and several variables. 

For a smooth function f: R — R of one variable, Taylor's theorem asserts 
that: 


fero) = fo) + G9) h + LOM? 4. aL Le) POD aK R4), () 


where 


xo+h (Xo +h- T} 


Ryo, h) = f staat 


is the remainder. For small A, this remainder is small to order k in the sense that 


fim Ro, h) 
im 


h0 EE X ni Q) 


In other words, Rg(xo, A) is small compared to the already small quantity hk, 

The preceding is the formal statement of Taylor’s theorem. What about the proof? 
As promised, we begin with the fundamental theorem of calculus, written in the 
form: 


xo+h 
fle +h) = flo) + i f'G) d. 
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Next, we write dt = —d(xo +h — t) and integrate parts? to give: 


oh 
Fo +h) = f Gxo) + f(xo)h +f S'o +h — 1) dr, 


Xo 
which is the first-order Taylor formula. Integrating by parts again: 


xol 
f" (1)(xo +h — x) dt 


Xo 


1 Xo 
ang f" (0) d(xo +h — zy 
Xo 


1 ] quus 
= zatz POr- oar. 
2 2 Jx 


which, when substituted into the preceding formula, gives the second-order Taylor 
formula: 


1 


xo+h 
7 f" (1)(xo +h — zy. dt. 


xo +h) = f (xo) + f(xo)h + sf"? + 
xo 
This is Taylor’s theorem for k = 2. 

Taylor’s theorem for general k proceeds by repeated integration by parts. The 
statement (2) that Ry(xo, A)/ h* — 0 ash — 0 is seen as follows. For t in the interval 
[xo, xo + A], we have |xo + A — t| < |h| and f**' (x), being continuous, is bounded; 
say, | f**"(z)| < M. Then: 


|Re(xo, A)| = fir) dt} < M 


xo+h (Xo + h—- Tř 
B k! 


nieht 
k! 


0 


and, in particular, | Ry(xo, A)/h*| < |h| M/k! — Oash > 0. 


?Recall that integration by parts (the product rule for the derivative read backwards) reads as: 


b b 
[ mt - [ vdu. 
a " 


Here we choose u = f(r) and v = xo +h — t. 
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Taylor's Theorem for Many Variables 


Ournext goal in this section is to prove an analogous theorem that is valid for functions 
of several variables. We already know a first-order version, that is, when k = 1. Indeed, 
if f: R” — R is differentiable at xy and we define 

Ri (Xo, h) = f (xo + h) — f (xo) — [Df (xo) ](h), 
so that 

f (xo +h) = f (xo) + [Df (xo)](h) + Ri(xo, h), 


then by the definition of differentiability, 


| Ri(xo, h)| 


—0 as h> 0, 
Ihi 


that is, R;(Xo, h) vanishes to first order at xo. In summary, we have: 


THEOREM 2: First-Order Taylor Formula Let f: U c R” > R be 
differentiable at x; € U. Then 


- a 
f (xo +h) = f(xo) + Yn Lao) + Ri(xo, h), 
izi 9X 


where Rı(xo, h)/||h|| —> 0 as h > 0 in R”. 


The second-order version is as follows: 


THEOREM 3: Second-Order Taylor Formula Let f: U c R" > 


R have continuous partial derivatives of third order? Then we may write 


f(x, +h) = fos) Y^ ha 00) +5 PLUS nus (Xo) + Ro(xo, B), 


where R2(Xo, h)/||h]? — 0 as h — 0 and the second sum is over all i’s and j’s 
between 1 and n (so there are n? terms). 


?For the statement of the theorem as given here, f actually needs only to be of class C?, but for a convenient form of the 
remainder we assume f is of class C?. 


3.2 Taylor's Theorem 


Notice that this result can be written in matrix 


form as 


hi 
a a 
Povo +m) = fae) (22 4 m : 
Xn 
hn 
af af af 
Ox; Ox; Ox, Ax2 OX) 0x, 
i f af af 
* 30 hu) Ox2 Ox,  0x20x2 0x2 0x, 
f af 9?f 
Xn OXI  0x,0x2 OX, 0X, 


+ Ro(Xo, h), 


where the derivatives of f are evaluated at xo. 


In the course of the proof of the Theorem 3, we shall obtain a useful explicit 


formula for the remainder, as in the single-variable theorem. 


PROOF OF THEOREM 3 Let g(t) = f (xo + th) with xo and h fixed, which is 
a C? function of t. Now apply the single-variable Taylor theorem (1) to g, with k = 2 


to obtain 
&(1) = g(0) + g'(0) + 77. + Ro, 
where 
R= [S u= 
By the chain rule, 


“a 
g()-' aL (o Th; — g')- 


i=l 


and 


m - f 
th= fae EM 
£e 2 1 OX; OX; Xk 


£D 


aT pat. 


n 


P» 


ij-l 


(Xo + th)h;h jh. 


zm + th)hihj, 


(3) 
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Writing R2 = R»(xo, h) we have thus proved: 


f(x +h) = foo nse Lati PUS ee (x0) + Ro(o,h), 
where (4) 


t-1P a 
Ro(Xo, h) = x[t ( ) FE m (xo + th)hjh jh, dt. 
A SI 1 


The integrand is a continuous function of ¢ and is therefore bounded by a positive 
constant C on a small neighborhood of xo (because it has to be close to its value at 
xo). Also note that |^;| < ||h||, for ||h|| small, and so 


|Ro(Xo, h)| < [|hIPC. 
In particular, 


| Rz(xo, h)] 

— — — < |h|C — 0 as h — 0, 

I|]? 

as required by the theorem. 

The proof of Theorem 2 follows analogously from the Taylor formula (1) with 

k = 1. A similar argument for R, shows that | R; (xo, h)|/||h|| > Oash — 0, although 
this also follows directly from the definition of differentiability. m 


Forms of the Remainder In Theorem 2, 


Rio, h) = Tl a-o 


ij-l 


vf 
;üx; 


(i; Jhi hj, 
(5) 


L- (xo ub) dt = pe 


where c;; lies somewhere on the line joining xo to xo 4- h. 
In Theorem 3, 


n 1 2 ay 
C=) af 
Río h) = > . f Eo gx, (o BIA he dt 
ij 


“1 ay 
= ADEL Cy ih hk, 5’ 
P» 3! Ox; 8xj Ox,“ ae e) 


where c;;; lies somewhere on the line joining xo to xo + h. 


3.2 Taylor's Theorem 199 


The formulas involving ¢;; and c;;; (called Lagrange’s form of the remainder) are 
obtained by making use of the second mean-value theorem for integrals. This states 
that 


b b 
f h(tyg(r) dt = hc) f g(r)dt, 


provided h and g are continuous and g > 0 on [a, b]; here c is some number between 
a and b.* This is applied in formula (5) for the explicit form of the remainder with 
h(t) = (82f/8x;x;)(xo + th) and g(t) = 1 — t. 

The third-order Taylor formula is 


8 n. Of Le af 
fGo € b) = Sat) ir (0) 5 P» hax O 


på P h;h;h m m h) 
; ij k Jx; Ox; Ox 0 3(Xo, h), 


where Rs(xo, h)/||h||> — 0 as h — 0, and so on. The general formula can be proved 
by induction, using the method of proof already given. 


589991] Compute the second-order Taylor formula for the function 
f(x, y) 5 sin (x + 2y), about the point xo = (0, 0). 


SOLUTION Notice that 


(0,0) — 0, 
8f (0,0) = cos(0-- 2-0) = 1, 9f (0,0) = 2cos (01-20) =2, 
Ox ay 

a BOR ca f = 
9,3 (0,0) — 0, 3 (0, 0) — 0, ix ay” 0) — 0. 


Thus, 
f(h) = fni, hz) = hy + 2h2 + R2(0, h), 
*Proof If g = 0, the result is clear, so we can suppose g # 0; thus, we can assume js b g(t)dt > 0. Let M and m be the 
maximum and minimum values of A, achieved at /,; and tm, respectively. Because g(t) > 0, 
b b b 
m f g(t)dt < f h(t)g(t)dt < mf g(t)dt. 
a a a 


Thus, ( r4 h(Og(o)dr) / ( T»: goat) lies between m = h(t) and M = h(tm) and therefore, by the intermediate-value 
theorem, equals A(c) for some intermediate c. B 
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where 


R2(0, h) 


TTE as h—0. 4 


| 9198997] Compute the second-order Taylor formula for f(x, y) = e* cos y 
about the point xo = 0, yo = 0. 


SOLUTION Here 


af af 
0,0) =1, 0,0) 1, 0, 0) = 0, 
f(0,0) a, 0.0 5, 0.0 =0 
af a a^ 
avd = 1 = , — Vy 
axi (0,0) — 1, a? (0, 0) I 3x ay? 0)=0 
and so 
Sh) = f(y, ho) = 1 +h + Eh? — 143 + RO, h), 
where 


R2(0, h) 
Ih]? 


—0 as h—0. a 


In the case of functions of one variable, one can expand f(x) in an infinite power 
series, called the Taylor series: 


7 2 k), k 
Fothen roomt LOOM 5... , Ee 


k! 


provided one can show that Rg(xo, h) — 0 as k — oo. Similarly, for functions of 
several variables the preceding terms are replaced by the corresponding ones involving 
partial derivatives, as we have seen in Theorem 3. Again, one can represent such a 
function by its Taylor series provided one can show that A — 0 as k — oo. This 
point is examined further in Exercise 7. 


DAAT Find the first- and second-order Taylor approximations to 
f(x, y) 5 sin(xy) at the point (xo, yo) = (1, 7/2). 


SOLUTION Here 
f (xo, yo) = sin (xoyo) = sin (7/2) = 1 
fs Yo) = yo cos (ojo) = Z cos(r/2) = 0 


Jy (Xo, yo) = xo cos (xoyo) = cos (7/2) = 0 
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2 
E Te . 
fx (Xo, Yo) = -y sin (xoyo) = = sin (7/2) = —z?/4 


à LAM 
xy (xo, yo) = cos (xoyo) — xoyo sin (xoyo) = =o 8n (1/2) = —/2 
yy (Xo, Yo) = —xà sin (xoyo) = —sin (7/2) = —1. 
Thus, the linear (first-order) approximation is 


U(x, y) = fo, yo) + fs Gro. Yo) — xo) + fy(xo. YoY — yo) 
=140+0=1, 


and the second-order (or quadratic) approximation is 


1 2 
gi. y) 1+0+0+3( «Je 1 «( ze »(» z) 


See Figure 3.2.1. A 


Figure 3.2.1 The linear and quadratic approximations to z = sin (xy) near (1, 7/2). 


DTW IHR.) Find linear and quadratic approximations to the expression 
(3.98 — 1?/(5.97 — 3. Compare with the exact value. 
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SOLUTION Let f(x, y) = (x — 1?/(y — 3". The desired expression is close to 
f(4, 6) = 1. To find the approximations, we differentiate: 


2(x — 1) —2(x — 1}? 
k= A h= 
(»-3) (y —3) 
—4(x — 1) 2 _ 6 — y 
ele Gap? Gap jin 
At the point of approximation, we have 
2 2 4 2 i 
f6-3 f ——h —fy—f&——-y fec f»—i 
The linear approximation is then 
2 2 
1+ 3 (0.02) _ 30.03) = 1.00666. 
The quadratic approximation is 
2(-0.02 4 2 (—0.03) 


2 2 
1 + $70.02) — 2(-0.03) + g (70.02 -0.03) 4 


9 2 3 2 


= 1.00674. 


The “exact” value using a calculator is 1.00675. A 


EXERCISES 


In each of Exercises 1 to 6, determine the second-order Taylor formula for the given function 
about the given point (xo, yo). 


1. f(x, y) = (x + y}, where xo = 0, yo = 0 

Ao f(xy) = VG +3? + 1), where xo = 0, yo —0 
3. f(x, y) =e", where xo = 0, yo = 0 

4. f(x,y) =e" cos (xy), where xy = 0, yo = 0 

S. f(x,y) = sin (y) + cos (xy), where xo = 0, yo = 0 
6. f(x,y) = e*-" cos y, where xy = 1, yo = 0 


7. (Challenging) A function f: IR — R is called an analytic function provided 


(k) 
fG 4h) = f(x) + fh 4-4 Ley pei 


[i.e., the series on the right-hand side converges and equals f(x + A)]. 
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(a) Suppose / satisfies the following condition: On any closed interval [a, 5], there is a 
constant M such that for all k = 1, 2,3,..., |f 9 (x)| < M* for all x € [a, b]. Prove that f is 
analytic. 


(b) Let f(x) = li 


Show that f is a C® function, but f is not analytic. 

(c) Give a definition of analytic functions from R” to R. Generalize the proof of part (a) 
to this class of functions. 

(d) Develop f(x, y) = e**” in a power series about xo = 0, yo = 0. 


3.3 Extrema of Real-Valued Functions 


i arctan ul ar 


As we saw in the book's Historical introduction, the early Greeks sought to 
mathematize nature and to find, as in the geometric Ptolemaic model of 
planetary motion, mathematical laws governing the universe. With the 
revival of Greek learning during the Renaissance, this point of view again 


took hold and the search for these laws recommenced. In particular, the 
question was raised as to whether there was one law, one mathematical 
principle that governed and superseded all others, a principle that the 
Creator used in His Grand Design of the Universe. 


MAUPERTUIS's PRINCIPLE. In 1744, the French scientist Pierre-Louis de 
Maupertuis (see Figure 3.3.1) put forth his grand scheme of the world. The 
“metaphysical principle"of Maupertuis is the assumption that nature always 
operates with the greatest possible economy. In short, physical laws are a 
consequence of a principle of “economy of means"; nature always acts in 
such a way as to minimize some quantity that Maupertuis called the action. 
Action was nothing more than the expenditure of energy over time, or 
energy x time. In applications, the type of energy changes with each case. 
For example, physical systems often try to “rearrange themselves" to have a 
minimum energy—such as a ball rolling from a mountain peak to a valley, 
or the primordial irregular Earth assuming a more nearly spherical shape. 
As another example, the spherical shape of soap bubbles is connected 
with the fact that spheres are the surfaces of least area containing a fixed 
volume. 

We state Maupertuis's principle formally as: Nature always minimizes 
action. Maupertuis saw in this principle an expression of the wisdom of the 
Supreme Being, of God, according to which everything in nature is 
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Pierre-Louis de Maupertuis (1698-1759). 


performed in the most economical way. He wrote: 


What satisfaction for the human spirit that, in contemplating these 
laws which contain the principle of motion and of rest for all bodies 
in the universe, he finds the proof of existence of Him who governs 
the world. 


Maupertuis indeed believed that he had discovered God'sfundamental law, 
the very secret of Creation itself, but he was actually not the first person to 
pose this principle. 

In 1707, Leibniz wrote down the principle of least action in a letter to 
Johann Bernoulli, which became lost until 1913, when it was discovered 
in Germany's Gotha library. For Leibniz, this principle was a natural 
outgrowth of his great philosophical treatise The Theodicy, in which he 
argues that God may indeed think of all possible worlds, but would want to 
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create only the best among them; and hence our world is necessarily the best 
of all possible worlds. 

Action, as defined by Leibniz, was motivated by the following 
reasoning, used in his letter. Think of a hiker walking along a road, and 
consider how to describe his action. If he travels 2 kilometers in 1 hour, you 
would say that he has carried out twice as much action as he would if he 
traveled 2 kilometers in 2 hours. However, you would also say that he carries 
out twice as much action in traveling 2 kilometers in 2 hours as he would in 
traveling 1 kilometer in 1 hour. Altogether then, our hiker, by walking 2 
kilometers in 1 hour, carries out 4 times as much action as he would in 
traveling 1 kilometer in 1 hour. 

Using this intuitive idea, Maupertuis defined action as the product of 
distance, velocity, and mass: 


Action — Mass x Distance x Velocity. 


Mass is included in this definition to account for the hiker'sbackpack. 
Moreover, according to Leibniz, the kinetic energy E is given by the formula: 


E- ; x Mass x (Velocity). 


So action has the same physical dimension as 
Energy x Time, 


because velocity is distance divided by time. 


PRINCIPLE OF LEAST ACTION. In the 250 years after Maupertuis 
formulated his principle, this principle of least action has been found to be a 
“theoretical basis" for Newton’s law of gravity, Maxwell's equations for 
electromagnetism, Schrédinger’s equation of quantum mechanics, and 
Einstein’s field equation in general relativity. 

Max Planck (see Figure 3.3.2), one of the greatest scientists of the modern 
era and the discoverer of the “quantization” of nature, was also a profound 
believer in the mathematical design of the universe. On June 29, 1922, on 
“Leibniz Day” in Berlin, Germany, just a few years after World War I, with all 
its terrible carnage, Planck delivered an address honoring this great scholar. 

What follows are some excerpts from Planck’s remarks: 


The Theodicy culminates with the statement that whatever occurs in 
our world, in the large as in the small, in nature as in spiritual life, is 
once and for all regulated by divine reason, and in such a way that 
our world is the best among possible worlds. 

Would Leibniz reaffirm this statement even today, in view of 
the misery of the present time, in view of the bitter failure of many 
efforts not immediately aimed at material gain, in view of the 
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Figure 3.3.2 Max Planck 
(1858-1947). 


undeniable fact that the imagined general harmony of people today 
seems to be further away from its realization than ever? No doubt, we 
should have to answer this question in the affirmative, even if we did 
not know that Leibniz never ceased to earnestly occupy himself until 
his last years despite an adverse fate and disappointments of all kinds, 
and we shall hardly err in assuming that it was exactly the Theodicy 
that gave him support and comfort in the most sorrowful days of his 
life. This once again is a touching example of the old truth that our 
most profound and most sacred principles are firmly rooted in our 
innermost soul, independent of experiences in the outer world. 
Modern science, in particular under the influence of the 
development of the notion of causality, has moved far away from 
Leibniz’s teleological point of view. Science has abandoned the 
assumption of a special, anticipating reason, and it considers each 
event in the natural and spiritual world, at least in principle, as 
reducible to prior states. But still we notice a fact, particularly in the 
most exact science, which, at least in this context, is most surprising. 
Present-day physics, as far as it is theoretically organized, is 
completely governed by a system of space-time differential 
equations which state that each process in nature is totally 
determined by the events which occur in its immediate temporal 
and spatial neighborhood. This entire rich system of differential 
equations, though they differ in detail, since they refer to 
mechanical, electric, magnetic, and thermal processes, is now 
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completely contained in a single dictum—the principle of least 
action. This, in short, states that, of all possible processes, the 

only ones that actually occur are those that involve minimum 
expenditure of action. As we can see, only a short step is required to 
recognize in the preference for the smallest quantity of action the 
ruling of divine reason, and thus to discover a part of Leibniz's 
teleological ordering of the universe? 

In present-day physics the principle of least action plays a 
relatively minor role. It does not quite fit into the framework of 
present theories. Of course, admittedly it is a correct statement; yet 
usually it serves not as the foundation of the theory, but as a true but 
dispensable appendix, because present theoretical physics is entirely 
tailored to the principle of infinitesimal local effects, and sees 
extensions to larger spaces and times as an unnecessary and 
uneconomical complication of the method of treatment. Hence, 
physics is inclined to view the principle of least action more as a 
formal and accidental curiosity than as a pillar of physical knowledge. 


"There is much more to the story of the least action principle, which we 
will revisit in Section 4.1. 


5For more information and history, consult S. Hildebrandt and A. J. Tromba, The Parsimonious Universe: Shape 
and Form in the Natural World, Springer-Verlag, New York/Berlin, 1995. 


Maxima and Minima for Functions of n- Variables 


As the previous remarks show, for Leibniz, Euler, and Maupertuis, and for much 
of modern science as well, all in nature is a consequence of some maximum or 
minimum principle. To make such grand schemes—as well as some that are more 
down to earth—effective, one must first learn the techniques of how to find maxima 
and minima of functions of n variables. 


Extreme Points 


Among the most basic geometric features of the graph of a function are its extreme 
points, at which the function attains its greatest and least values. In this section, we 
derive a method for determining these points. In fact, the method locates local extrema 
as well. These are points at which the function attains a maximum or minimum value 
relative only to nearby points. Let us begin by defining our terms. 


DEFINITION If f: U C R” > R is a given scalar function, a point xo € U is 
called a local minimum of f if there is a neighborhood V of xo such that for all points 
x in V, f(x) > f(xo). (See Figure 3.3.3.) Similarly, xo € U is a local maximum 
if there is a neighborhood V of xo such that f(x) < f(xo) for all x € V. The point 
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Graph of f 


— 


Graph of f 


1 
1 
e 
Xo 


Local minimum Local maximum 


(a) (b) 


Figure 3.3.3 (a) Local minimum and (b) local maximum points for a function of 
two variables. 


Xo € U is said to be a local, or relative, extremum if it is either a local minimum or 
a local maximum. A point Xo is a critical point of f if either f is not differentiable 
at Xo, or if itis, Df(xo) — 0. A critical point that is not a local extremum is called a 
saddle point. 


First Derivative Test for Local Extrema 


The location of extrema is based on the following fact, which should be familiar from 
one-variable calculus (the case n — 1): Every extremum is a critical point. 


THEOREM 4: First Derivative Test for Local Extrema If U c IR" 


is open, the function f: U C IR" — R is differentiable, and xo € U is a local 
extremum, then Df (xo) = 0; that is, xo is a critical point of f. 


PROOF Suppose that f achieves a local maximum at x9. Then for any h € IR", 
the function g(t) = f(xo + th) has a local maximum at ¢ = 0. Thus, from one- 
variable calculus g'(0) = 0.7 On the other hand, by the chain rule, 


8'(0) = [Df (xo)]h. 


Thus, [D/(xo)]h = 0 for every h, and so Df(xo) = 0. The case in which f achieves 
a local minimum at xo is entirely analogous. E 


©The term “saddle point” is sometimes not used this generally; we shall discuss saddle points further in the subsequent 
development. 

"Recall the proof from one-variable calculus: Because g(0) is a local maximum, g(t) < g(0) for small ¢ > 0, so g(t) — 
&(0) < 0, and hence g'(0) = limit (gr) — g(0))/t < 0, where limit means the limit as £ — 0, £ > 0. For small £ < 0, we 


similarly have g'(0) = limit (et). — g(0))/t > 0. Therefore, g’ So) = 
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If we remember that Df(xo) = 0 means that all the components of Df(xo) are 
Zero, we can rephrase the result of Theorem 4: If xo is a local extremum, then 


a 
M o =0, i= ln 
Ox; 


that is, each partial derivative is zero at xo. In other words, Vf (xo) = 0, where Vf is 
the gradient of f. 

If we seek to find the extrema or local extrema of a function, then Theorem 4 
states that we should look among the critical points. Sometimes these can be tested 
by inspection, but usually we use tests (to be developed below) analogous to the 
second-derivative test in one-variable calculus. 


[EXAMPLE 1| Find the maxima and minima of the function f: R? — R, defined 
by 


y f(x, y) =x? + y’. (Ignore the fact that this example can be done by inspection.) 


SOLUTION We first identify the critical points of f by solving the two equations 
af/ax = 0 and df/dy = 0, for x and y. But 


a a 
af L2x and ar =2y, 

ax ay 
so the only critical point is the origin (0, 0), where the value of the function is zero. 
Because f(x, y) > 0, this point is a relative minimum—in fact, an absolute, or global, 
minimum—of f. Because (0, 0) is the only critical point, there are no maxima. A 


'D.7. 209! Consider the function f: R? > R, (x,y) x? — y?. Ignoring 
for the moment that this function has a saddle and no extrema, apply the method 
of Theorem 4 for the location of extrema. 


SOLUTION As in Example 1, we find that f has only one critical point, at the 
origin, and the value of f there is zero. Examining values of f directly for points near 
the origin, we see that f(x, 0) > f(0, 0)and /(0, y) < f(0, 0), with strict inequalities 
when x # Oand y ¥ 0. Because x or y can be taken arbitrarily small, the origin cannot 
be either a relative minimum or a relative maximum (so it is a saddle point). Therefore, 
this function can have no relative extrema (see Figure 3.3.4). A 


EXAMPI Find all the critical points of z = x?y + y?x. 
SOLUTION Differentiating, we obtain 


a a 
Z =2xy +), Z = 2xy +x. 
ax ay 


Equating the partial derivatives to zero yields 


2xy +y? =0, 2xy +x? =0. 
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Graph off 


Figure 3.3.4 A function of two variables 
with a saddle point. 


Subtracting, we obtain x? = y?. Thus, x = +y. Substituting x = +y in the first of 
the two preceding equations, we find that 


2’ +y 2 3? =0, 
so that y = 0 and thus x = 0. Ifx = —y, then 
-2yb +y? = -y? =0, 
so y = 0 and therefore x = 0. Hence, the only critical point is (0, 0). For x = y, 


z = 2x3, which is both positive and negative for x near zero. Thus, (0, 0) is not a 
relative extremum. A 


Solio") Refer to Figure 3.3.5, a computer-drawn graph of the function 
z = 2(x? + y?) eX, Where are the critical points? 


Figure 3.3.5 The volcano: z = 2(x? + y?) exp(—x? — y?). 
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SOLUTION Because z = 2(x? + y?)e-* — , we have 


a. A 2 
3 = Ax(e 7") 4 20? + De (2.x) 
xX 


=e [dx — Ax? + y?)] 
= 4x(e (1 — x? — y?) 
and 


az 
= = Aye "Yl — x? - y). 
ay 
These both vanish when x = y = 0 or when x? + y? = 1. This is consistent with the 
figure: Points on the crater’s rim are maxima and the origin is a minimum. A 


Second Derivative Test for Local Extrema 


The remainder of this section is devoted to deriving a criterion, depending on the 
second derivative, for a critical point to be a relative extremum. In the special case 
n = 1, our criterion will reduce to the familiar condition from one-variable calculus: 
f" (xo) > 0 for a minimum and /"(xo) < 0 for a maximum. But in the general con- 
text, the second derivative is a fairly complicated mathematical object. To state our 
criterion, we will introduce a version of the second derivative called the Hessian, 
which in turn is related to quadratic functions. Quadratic functions are functions 
g: R” — R that have the form 


gis. ssl = > aijhih; 
ijs 


for an x n matrix [a;;]. In terms of matrix multiplication, we can write 


au dp cc Is hi 
ghi, ha) = [hu] 
Ani Am cto Am hn 
For example, if n = 3, 
1 -1 0 hy 
gi, ho, h3) = hî — 2h hia + h$ = [hy h2 h3] =l 0 0 ho 
0 € 1 h3 


is a quadratic function. 
We can, if we wish, assume that [a;;] is symmetric; in fact, g is unchanged if 
we replace [a;;] by the symmetric matrix [b;;], where bj; = i + aji), because 
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hih; = h;h; and the sum is over all i and j. The quadratic nature of g is reflected in 
the identity 


gi, AR) = gifs An), 


which follows from the definition. 
Now we are ready to define Hessian functions (named after Ludwig Otto Hesse, 
who introduced them in 1844). 


DEFINITION Suppose that f: U C IR" — R has second-order continuous deriva- 
tives (0?f/ Ax; 0x;)(xo). for i, j = 1, ..., n, ata point xo € U. The Hessian of f at 
Xo is the quadratic function defined by 


Hf) 1 yo 4 agua 
0 E "A. aL UE j 
2 A>, dx; Ax; d 
vf af 

í Ox, Ox, Ox, Xn hi 

= zhi -> An) : : 

af f hy 

Ox, OX] Ox, 0x, 


Notice that, by equality of mixed partials, the second derivative matrix is 
symmetric. 

This function is usually used at critical points xo € U. In this case, D/(xo) = 0, 
so the Taylor formula (see Theorem 2, Section 3.2) may be written in the form 


(xo +h) = f(xo) + H f (xoY(h) + R»(xo, h). 


Thus, at a critical point the Hessian equals the first nonconstant term in the Taylor 
series of f. 

A quadratic function g: IR" > R is called positive-definite if g(h) > 0 for all 
h € R" and g(h) — 0 only for h — 0. Similarly, g is negative-definite if g(h) « 0 
and g(h) = 0 for h = 0 only. Note that if n = 1, Hf (xo) = 4 f”(xo)h?, which is 
positive-definite if and only if f"(xo) > 0. 


THEOREM 5: Second Derivative Test for Local Extrema If 
f: U CR" > Ris of class C?, xo € U isa critical point of f, and the Hessian 


H f (xo) is positive-definite, then xo is a relative minimum of f. Similarly, if 
H f (xo) is negative-definite, then xo is a relative maximum. 
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Actually, we shall prove that the extrema given by this criterion are strict. A 
relative maximum xo is said to be strict if f(x) < f(xo) for nearby x Æ xo. A strict 
relative minimum is defined similarly. Also, the theorem is valid even if f is only C? 
but we have assumed C? for simplicity. 

The proof of Theorem 5 requires Taylor's theorem and the following result from 
linear algebra. 


LEMMA 1 If B = [b;j] is an n x n real matrix, and if the associated quadratic 
function 


1 n 
H: R” —> R, (Ay, ..., hn) > 3 J bijhihj 
ij=l 


is positive-definite, then there is a constant M > 0 such that for all h € R”, 
H(h) > M|hļ}?. 


PROOF For ||h|| = 1, set g(h) = H(h). Then g is a continuous function of h for 
||h|| = 1 and so achieves a minimum value, say M.9 Because H is quadratic, we have 


h h h 
H(h) = H H hj? = ( ) hl? > Ml? 
(h) (q^!) (a)! lt =g Tm li^ > Mihi 


for any h Æ 0. (The result is obviously valid if h = 0). m 


Note that the quadratic function associated with the symmetric matrix 
1(0?f/àx; x;) is exactly the Hessian. 


PROOF OF THEOREM 5 Recall that if f: U C IR" > R is of class C? and 
xo € U is a critical point, Taylor's theorem may be expressed in the form 


f (Xo +h) — f(xo) = Hf(xo)(h) + R2(x0, h), 


where (R2(xo, h))/||h||? > 0 as h — 0. 
Because H/f(xo) is positive-definite, lemma 1 assures us of a constant M > 0 
such that for all h c R" 


Hf (xo)(h) > MIhll". 


* Here we are using, without proof, a theorem analogous to a theorem in calculus that states that every continuous function 
on an interval [a, b] achieves a maximum and a minimum; see Theorem 7. 
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Because R2(xo, h)/||h||? — 0 as h — 0, there isa 6 > 0 such that for 0 < ||h|| < à 
|Ro(xo, h)| < MIIhl?. 


Thus, 0 < Hf(xo)(h) + R2(xo, h) = f(xo + h) — f(xo) for 0 < ||h|| < ô, so that xo 
is a relative minimum, in fact, a strict relative minimum. 

The proof in the negative-definite case is similar, or else follows by applying the 
preceding to — f, and is left as an exercise. iE 
Consider again the function f: R? > R, (x, y) œ> x? + y?. Then 
(0, 0) is a critical point, and f is already in the form of Taylor's theorem: 


SCO, 0) + (hi, ha) = f(O, 0) + (hi + 15) + 0. 
We can see directly that the Hessian at (0, 0) is 
Hf(0)(h) = h? +h, 
which is clearly positive-definite. Thus, (0, 0) is a relative minimum. This simple case 
can, of course, be done without calculus. Indeed, it is clear that f(x, y) > 0 for all 


(x,y) # (0,0. 4 


For functions of two variables f(x, y), the Hessian may be written as follows: 


AA 
E ax? dy Ox hy 
Hf(x, y)(h) = 5Uti, h2] a af hy | 
axdy dy? 


Now we shall give a useful criterion for when a quadratic function defined by such a 
2 x 2 matrix is positive-definite. This will be useful in conjunction with Theorem 5. 


LEMMA 2 Let 


a b hi 
s-[ A and um = yan 


Then H(h) is positive-definite if and only if a > 0 and det B = ac — b? > 0. 
PROOF We have 


ah, + bh; 


H(h) = 3[/i, h2] b Bus 


| = lah] + 2bhyh2 + ch?). 


3.3 Extrema of Real- Valued Functions 215 


Let us complete the square, writing 
2 2 
H(h) = ja(m +m) ESL 
a 2 a 


Suppose H is positive-definite. Setting h2 = 0, we see that a > 0. Setting hı = 
—(b/a)h2, we get c — b? /a > 0 orac — D? > 0. Conversely, ifa > Oandc —b?/a > 0, 
H(h) is a sum of squares, so that H(h) > 0. If H(h) = 0, then each square must 
be zero. This implies that both hı and A; must be zero, so that H(h) is positive- 
definite. m 


Similarly, one can see that H(h) is negative-definite if and only if a <0 and 
ac — D? > 0. We note that an alternative formulation is that H(h) is positive- (respec- 
tively, negative-) definite if a + c = trace B > 0 (respectively, < 0) and det B > 0. 


Determinant Test for Positive Definiteness 


There are similar criteria for an n x n symmetric matrix B. Consider the n square 
submatrices along the diagonal (see Figure 3.3.6). B is positive-definite (that is, the 
quadratic function associated with B is positive-definite) if and only if the determi- 
nants of these diagonal submatrices are all greater than zero. For negative-definite B, 
the signs should be alternately <0 and > 0. We shall not prove this general case here.? 
In case the determinants of the diagonal submatrices are all nonzero, but the matrix 
is not positive- or negative-definite, the critical point is of saddle type; in this case, 
one can show that the point is neither a maximum nor a minimum in the manner of 
Example 2. 


n 


Figure 3.3.6 "Diagonal" submatrices are used in the criterion for 
positive definiteness; they must all have determinant 0. 


?This is proved in, for example, K. Hoffman and R. Kunze, Linear Algebra, Prentice Hall, Englewood Cliffs, N.J., 1961, 
pp. 249—251. For students with sufficient background in linear algebra, it should be noted that B is positive-definite when 
all of its eigenvalues (which are necessarily real, because B is symmetric) are positive. 
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Second Derivative Test 


Lemma 2 and Theorem 5 imply the following result: 


THEOREM 6: Second Derivative Maximum-Minimum Test 
for Functions of Two Variables Let f(x, y) be of class C? on an open 
set U in R?. A point (xo, yo) is a (strict) local minimum of f provided the 
following three conditions hold: 


O Ze.) = 26) =0 
x ay 


a2 
(i) X00, 90) > 0 


" Pf) (PS af y 
Gin B= (5.5) (5) - Caray) meom 


(D is called the discriminant of the Hessian.) If in (ii) we have <0 instead of 
> 0 and condition (iii) is unchanged, then we have a (strict) local maximum. 


5 7120917] Classify the critical points of the function f: R? — R, defined by 
(x, y) e x? — 2xy +2y?. 


SOLUTION As in Example 5, we find that /(0, 0) = 0, the origin is the only 
critical point, and the Hessian is 


Hf(0)h) = h? — 2hiha + 252 = (hı — hay. + hå, 
1 2 


which is clearly positive-definite. Thus, f has a relative minimum at (0, 0). Al- 
ternatively, we can apply Theorem 6. At (0,0), 02//8x? = 2, 9?f/8y? = 4, and 
8?f/8x ay = —2. Conditions (i), (ii), and (iii) hold, so f has a relative minimum 
at(0,0) 4 


If D « 0 in Theorem 6, then we have a saddle point. In fact, one can prove that 
f(x, y) is larger than f'(xo, yo) as we move away from (xo, yo) in some direction and 
smaller in the orthogonal direction (see Exercise 26). The general appearance is thus 
similar to that shown in Figure 3.3.4. The appearance of the graph near (xo, yo) in the 
case D — 0 must be determined by further analysis. 

We summarize the procedure for dealing with functions of two variables: After 
all critical points have been found and their associated Hessians computed, some of 
these Hessians may be positive-definite, indicating relative minima; some may be 
negative-definite, indicating relative maxima; and some may be neither positive- nor 
negative-definite, indicating saddle points. The shape of the graph at a saddle point 
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where D < 0 is like that in Figure 3.3.4. Critical points for which D z 0 are called 
nondegenerate critical points. Such points are maxima, minima, or saddle points. 
The remaining critical points, where D — 0, may be tested directly, with level sets 
and sections or by some other method. Such critical points are said to be degenerate; 
the methods developed in this chapter fail to provide a picture of the behavior of a 
function near such points, so we examine them case by case. 


DENIA] Locate the relative maxima, minima, and saddle points of the 
function 


f(x. y) = log (x? 4- y? 4- 1). 


SOLUTION We must first locate the critical points of this function; therefore, 
according to Theorem 3, we calculate 


2x TI 2y X 
1 . 
xxyrl1 xryal 


VfG y)- 


Thus, Vf(x, y) = 0 if and only if (x, y) = (0, 0), and so the only critical point of f is 
(0, 0). Now we must determine whether this is a maximum, a minimum, or a saddle 
point. The second partial derivatives are 


9f — 2? +y? + 1) - Qx)Qx) 


ax? (x? +y? + 1)? J 
8f _ 2x? + y? +1) - Qy)Q») 
ay? — (x? + y? + 1)? , 
and 
Vf o — —2x(2y) 
ax dy (x2 + y? +1)?” 
Therefore, 
af af ar 
—(0, 0) =2 = — (0,0 d = —(0,0)=0, 
3x2) Bi sd Ve) 
which yields 


D=2-2=4>0. 


Because (87f/8x?)(0, 0) > 0, we conclude by Theorem 6 that (0, 0) is a local min- 
imum. (Can you show this just from the fact that log ¢ is an increasing function of 
t>0?) A 
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DESIG) The graph of the function g(x, y) = 1/xy is a surface S in IR?. 
Find the points on S that are closest to the origin (0, 0, 0). (See Figure 3.3.7.) 


Figure 3.3.7 The surface z = 1/xy in the 
first quadrant. (There are similar figures in 
the other quadrants, but notice that z < 0 
in the second and fourth quadrants.) 


SOLUTION Each point on S is of the form (x, y, 1/xy). The distance from this 
point to the origin is 


dx, y) 2 x? +y + 4. 
xy 
It is easier to work with the square of d, so let f(x, y) = x? + y? + (1/x?y?), which 
will have the same minimum point. Notice that f(x, y) becomes very large as x and 
y get larger and larger; f(x, y) also becomes very large as (x, y) approaches the x 
or y axis where f is not defined, so f must attain a minimum at some critical point. 
The critical points are determined by: 


of — " 2 o 
ax — x3y2 
af 2 

ay — d yix? ? 


that is, x*y? — 1 = 0, and x?y* — 1 = 0. From the first equation we get y? = 1/x^, 
and, substituting this into the second equation, we obtain 
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Thus, x 1 and y 1, and it therefore follows that f has four critical points, 
namely, (1, 1), (1, —1), (—1, 1), and (—1, —1). Note that f has the value 3 for all 
these points, so they are all minima. Therefore, the points on the surface closest to 
the point (0, 0, 0) are (1, 1, 1), (1, —1, —1), (—1, 1, —1), and (—1, —1, 1) and the 
minimum distance is 4/3. Is this consistent with the graph in Figure 3.3.7? A 


DOWIE) Analyze the behavior ofz = x°y + xy? + xy at its critical points. 
SOLUTION The first partial derivatives are 

az à 5 4 LM 

ap 9r Y+ +y = yx ty +1) 


and 


a 
oe x(5y* 4- x* +1). 
ay 
The terms 5x^ + y^ + 1 and Sy* + x^ + 1 are always greater than or equal to 1, and 
so it follows that the only critical point is (0, 0). 
The second partial derivatives are 


az 3 8?z 3 
ac » [T e 
and 
az 
= 5x44 5y +1. 
<< o 


Thus, at (0, 0), D = —1, and so (0, 0) is a nondegenerate saddle point and the graph 
of z near (0, 0) looks like the graph in Figure 3.3.4. A 


Global Maxima and Minima 


We end this section with a discussion of the theory of absolute, or global, maxima 
and minima of functions of several variables. Unfortunately, the location of absolute 
maxima and minima for functions on R” is, in general, a more difficult problem than 
for functions of one variable. 


DEFINITION Suppose f: A > Risa function defined on a set A in R? or R?. A 
point xo € A is said to be an absolute maximum (or absolute minimum) point of f 


if f(x) < f(xo) [or f(x) = f(xo)] for all x € A. 


In one-variable calculus, one learns—but often does not prove—that every con- 
tinuous function on a closed interval / assumes its absolute maximum (or minimum) 
value at some point xo in 7. A generalization of this theoretical fact also holds in R”. 
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Such theorems guarantee that the maxima or minima one is seeking actually exist; 
therefore, the search for them is not in vain. 


DEFINITION A set D € R" is said to be bounded if there is a number M > 0 
such that ||x|| < M for all x € D. A set is closed if it contains all its boundary points. 


Thus, a set is bounded if it can be strictly contained in some (large) ball. The 
appropriate generalization of the one-variable theorem on maxima and minima is the 
following result, stated without proof. 


THEOREM 7: Global Existence Theorem for Maxima and Minima 
Let D be closed and bounded in R” and let f: D — R be continuous. Then f assumes 
its absolute maximum and minimum values at some points xo and x, of D. 


Simply stated, xy and x; are points where f assumes its largest and smallest 
values. As in one-variable calculus, these points need not be uniquely determined. 

Suppose now that D = U U ðU, where U is open and dU is its boundary. If 
D CR’, we suppose that 3U is a piecewise smooth curve; that is, D is a region 
bounded by a collection of smooth curves—for example, a square or the sets depicted 
in Figure 3.3.8. 


Figure3.3.8 D = U U ðU: Two examples of regions whose boundary is a 
piecewise smooth curve. 


If xo and x, are in U, we know from Theorem 4 that they are critical points of 
f. If they are in 3U, and dU is a smooth curve (i.e., the image of a smooth path c 
with ec’ z 0), then they are maximum or minimum points of f viewed as a function 
on ðU. These observations provide a method of finding the absolute maximum and 
minimum values of f on a region D. 
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Strategy for Finding the Absolute Maxima and Minima on a Region 
with Boundary Let f be a continuous function of two variables defined on 
a closed and bounded region D in R?, which is bounded by a smooth closed 
curve. To find the absolute maximum and minimum of f on D: 


(i) Locate all critical points for f in U. 
(ii) Find the maximum and minimum of f viewed as a function only on dU. 
(iii) Compute the value of f at all of these critical points. 


(iv) Compare all these values and select the largest and the smallest. 


If D is a region bounded by a collection of smooth curves (such as a square), 
then one follows a similar procedure, but including in step (iii) the points where the 
curves meet (such as the corners of the square). 

All the steps except step (ii) should now be familiar to the student. To carry out 
step (ii) in the plane, one way is to find a smooth parametrization of 3U ; that is, we find 
a path c: Z — dU, where / is some interval, which is onto 3U. Second, we consider 
the function of one variable t +> /f(c(t)), where t € I, and locate the maximum and 
minimum points £j, t; € / (remember to check the endpoints!). Then c(fo), c(t;) will 
be maximum and minimum points for f as a function on dU. Another method for 
dealing with step (ii) is the Lagrange multiplier method, to be presented in the next 
section. 


DGG Soe] Find the maximum and minimum values of the function 
f(x,y) =x? - y? — x — y + l in the disk D defined by x? + y? < 1. 
SOLUTION (i) To find the critical points we set 3f /3x = 0f/dy = 0. Thus, 2x — 
1 — 0, 2y — 1 = 0, and hence (x, y) = G, 3) is the only critical point in the open 
disk U = ((x, y) | x + y? < 1). 

(ii) The boundary dU can be parametrized by e(t) = (sint, cost), 0 < t < 2x. 
Thus, 


f(c(t)) = sin? t + cos? t — sint — cost + 1 = 2 — sint — cost = g(t). 


To find the maximum and minimum of f on aU, it suffices to locate the maximum 
and minimum of g. Now g'(t) = 0 only when 


, ; -T 5 
sint = cost, that is, when t = —, —. 
4 4 


Thus, the candidates for the maximum and minimum for f on ðU are the points 
(2/4), e(57 /4) and the endpoints c(0) = c(27). 
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(iii) The values of f at the critical points are: (1, $) = 1 from step (i) and, from 


step (ii), 
NE-A) 
4) C89)» 
and 


f(«(0)) = f(eQx)) = f(0, 1) = 1. 


(iv) Comparing all the values i 2 — 42, 2 + V2, 1, itis clear that the absolute mini- 
mum occurs at (1/2, 1/2) and the absolute maximum occurs at (—4/2/2, —/2/2). A 


In Section 3.4, we shall consider a generalization of the strategy for finding the 
absolute maximum and minimum to regions D in IR". 


EXERCISES 


In Exercises 1 to 16, find the critical points of the given function and then determine whether 
they are local maxima, local minima, or saddle points. 


l fœ,y)=x -y +xy 


2. f(x, y) 2 x* y! — xy 


3. f(x, y) =x? + y?  2xy 

4 f@ y) ox +y? + 3xy 

5. f(x, y) 2 e 

6. f(t. y) =x? - 3xy + 5x -2y + 6y? +8 

7. f(x y) S 3X2 E 2xy 2x - y! c y £4 

8. f(x, y) = sin (x? + y?) [consider only the critical point (0, 0)] 


9. f(x, y) = cos (x? + y?) [consider only the three critical points (0, 0), (4/772, /7/2), 
and (0, /7)] 


10. f(x, y) 2 y - xsiny 
1. f(x,y) =e* cosy 


12. f(x, y) (x — yay - 1) 


AE 
13. f( y)=xy +-+- 
fœ, y») EIS 
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14. f(x, y) 2 logQ + sinxy) 
15. f(x,y) =xsiny 


16. fG y) =(@+y)ay +1) 


17. Find the local maxima and minima for z = (x? + 3y2) el? ". (See Figure 2.1.15.) 
18. Let f(x, y) =x? + y? + kxy. If you imagine the graph changing as k increases, at what 
values of k does the shape of the graph change qualitatively? 


19. An examination of the function f: R? > R, (x, y) 9 (y — 3x*)(y — x?) will give an 
idea of the difficulty of finding conditions that guarantee that a critical point is a relative 
extremum when Theorem 6 fails.'° Show that 


(a) the origin is a critical point of f; 

(b) f has a relative minimum at (0, 0) on every straight line through (0, 0); that is, if 
g(t) = (at, bt), then f o g: R > R has a relative minimum at 0, for every choice of a and b; 

(c) the origin is not a relative minimum of /. 


20. Let f(x, y) = Ax? + E where A and E are constants. What are the critical points of f? 
Are they local maxima or local minima? 


21. Let f(x, y) =x? — 2xy + y?. Here D = 0. Can you say whether the critical points are 
local minima, local maxima, or saddle points? 


22. Find the point on the plane 2x — y 4- 2z — 20 nearest the origin. 


23. Show that a rectangular box of given volume has minimum surface area when the box is 
a cube. 


24. Show that the rectangular parallelepiped with fixed surface area and maximum volume is 
a cube. 


25. Write the number 120 as a sum of three numbers so that the sum of the products taken 
two at a time is a maximum. 


26. Show that if (xo, yo) is a critical point of a quadratic function f(x, y) and D < 0, then 
there are points (x, y) near (xo, yo) at which f(x, y) > f(xo, yo) and, similarly, points for 
which f(x, y) < f(xo, yo). 


27. Determine the nature of the critical points of the function 
fe yz) mx +y $2? xy. 


28. Let n be an integer greater than 2 and set f(x, y) = ax" + cy", where ac # 0. 
Determine the nature of the critical points of f. 


Y"This interesting phenomenon was first pointed out by the famous mathematician Giuseppe Peano (1858-1932). Another 
curious “pathology” is given in Exercise 41. 
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29. Determine the nature of the critical points of f(x, y) = x? + y? — 6xy + 6x + 3y. 


30. Find the absolute maximum and minimum values of the function f(x, y) = (x? + ?)* 
on the disk x? + y? < 1. (You do not have to use calculus.) 


31. Repeat Exercise 30 for the function f(x, y) =x? +xy + y’. 


32. Accurve C in space is defined implicitly on the cylinder x? + y? = 1 by the additional 
equation x? — xy + y? — z? = 1. Find the point or points on C closest to the origin. 


33. Find the absolute maximum and minimum values for f(x, y) = sinx + cos y on the 
rectangle R defined by 0 € x < 27,0 € y < 2x. 


34. Find the absolute maximum and minimum values for the function f(x, y) — xy on the 
rectangle R defined by -1 < x «1, 1 < y < 1. 


35. Determine the nature of the critical points of f(x, y) = xy + 1/x + 8/y. 


In Exercises 36 through 40, D denotes the unit disk. 


36. Letu be a C? function on D which is “strictly subharmonic”; that is, the following 
inequality holds: V?u = (3?u/3x?) + (82u/8y?) > 0. Show that u cannot have a maximum 
point in D\dD (the set of points in D, but not in 9D). 


37. Letu be a harmonic function on D; that is, V?u = 0 on DN8 D and be continuous on 
D. Show that if u achieves its maximum value in DVà D, it also achieves it on 0 D. This is 
sometimes called the “weak maximum principle" for harmonic functions. [HiNT: Consider 
V?(u + €e*), € > 0. You can use the following fact, which is proved in more advanced texts: 
Given a sequence {pq}, n = 1, 2, ..., of points in a closed bounded set A in IR? or R3, there 
exists a point q such that every neighborhood of q contains at least one member of (p, ).] 


38. Define the notion of a strict superharmonic function u on D by mimicking Exercise 36. 
Show that u cannot have a minimum in D\dD. 


39. Let u be harmonic in D as in Exercise 37. Show that if u achieves its minimum value in 
D\@D, it also achieves it on à D. This is sometimes called the “weak minimum principle" for 
harmonic functions. 


40. Let $: 3D — R be continuous and let T be a solution on D to V?T = 0, continuous on 
D and T = $ on 3D. 


(a) Use Exercises 36 to 39 to show that such a solution, if it exists, must be unique. 

(b) Suppose that 7(x, y) represents a temperature function that is independent of time, 
with ¢ representing the temperature of a circular plate at its boundary. Can you give a 
physical interpretation of the principle stated in part (a)? 


41. (a) Let f bea C! function on the real line IR. Suppose that f has exactly one critical 
point x, that is a strict local minimum for f. Show that xo is also an absolute minimum for f, 
that is, that f(x) > f(xo) for all x. 
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(b) The next example shows that the conclusion of part (a) does not hold for functions 
of more than one variable. Let f: R? — R be defined by 


f(x, y)2 65 — e x 2yMes t e. 


(i) Show that (0, 0) is the only critical point for f and that it is a local minimum. 
(ii) Argue informally that f has no absolute minimum. 


42. Suppose that a pentagon is composed of a rectangle topped by an isosceles triangle (see 
Figure 3.3.9). If the length of the perimeter is fixed, find the maximum possible area. 


Figure 3.3.9 Maximize the area for fixed perimeter. 


y 


3.4 Constrained Extrema and Lagrange Multipliers 


Often one is required to maximize or minimize a function subject to certain constraints 
or side conditions. For example, we might need to maximize f(x, y) subject to the 
condition that x? + y? = 1; that is, that (x, y) lie on the unit circle. More generally, 
we might need to maximize or minimize f(x, y) subject to the side condition that 
(x, y) also satisfies an equation g(x, y) — c where g is some function and c equals a 
constant [in the preceding example, g(x, y) = x? + y?, and c = 1]. The set of such 
(x, y) is a level curve for g. 

The purpose of this section is to develop some methods for handling this sort 
of problem. In Figure 3.4.1 we picture a graph of a function f(x, y). In this picture, 


z= f(x, y) subject 
to the constraint . " : 
34y-l Figure 3.4.1 The geometric meaning of 


maximizing f subject to the constraint 
x+y =l. 


Point on x? +y? = 1 where fis maximized 
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the maximum of f might be at (0, 0). However, suppose we are not interested in 
this maximum but only the maximum of f(x, y) when (x, y) belongs to the unit 
circle; that is, when x? + y? = 1. The cylinder over x? + y? = 1 intersects the graph 
of z = f(x, y) in a curve that lies on this graph. The problem of maximizing or 
minimizing f(x, y) subject to the constraint x? + y? = 1 amounts to finding the 
point on this curve where z is the greatest or the least. 


The Lagrange Multiplier Method 


In general, let f: U C R” — Rand g: U C R” > R be given C! functions, and let 
S be the level set for g with value c [recall that this is the set of points x € IR" with 
g(x) — c]. 

When f is restricted to $ we again have the notion of local maxima or local 
minima of f (local extrema), and an absolute maximum (largest value) or absolute 
minimum (smallest value) must be a local extremum. The following method provides 
a necessary condition for a constrained extremum: 


THEOREM 8: The Method of Lagrange Multipliers Suppose that 
f: U CR" > Rand g: U C IR" — Rare given C! real-valued functions. Let 
xo € U and g(xo) — c, and let S be the level set for g with value c (recall that 
this is the set of points x € R” satisfying g(x) = c). Assume Vg(xo) Æ 0. 

If f|S, which denotes “ f restricted to $,” has a local maximum or minimum 
on S at xo, then there is a real number A such that 


Vf (xo) = AVg(xo). (1) 


In general, a point xo where equation (1) holds is said to be a critical point 
of f'|S. 


PROOF We have not developed enough techniques to give a complete proof, but 
wecan provide the essential points. (The additional technicalities needed are discussed 
in Section 3.5 and in the Internet supplement.) 

In Section 2.6 we learned that for n = 3 the tangent space or tangent plane of S 
at Xo is the space orthogonal to Ve(xo). For arbitrary n we can give the same definition 
for the tangent space of S at xo. This definition can be motivated by considering 
tangents to paths e(ż) that lie in S, as follows: If e(ż) is a path in S and c(0) = xo, then 
€'(0) is a tangent vector to S at xo, but 


d d 
Fe) = c = 0, 
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and on the other hand, by the chain rule, 
(c(t) = Vg(xo)-¢ 0), 
7 g g(Xo) ( 


so that Ve(xo) - €'(0) = 0; that is, €'(0) is orthogonal to Vg(xo). 
If f|S has a maximum at xo, then f(c(t)) has a maximum at t = 0. By one- 
variable calculus, d f (c(t))/dt|,-o = 0. Hence, by the chain rule, 


d ; 
0— 4/0), , = FG) e(). 


Thus, V/ (xo) is perpendicular to the tangent of every curve in $ and so is perpendicular 
to the whole tangent space to S at xo. Because the space perpendicular to this tangent 
space is a line, Vf (xo) and Vg(xo) are parallel. Because Vg(xo) Æ 0, it follows that 
Vf(xo) is a multiple of Vg(xo), which is the conclusion of the theorem. m 


Let us extract some geometry from this proof. 


THEOREM 9 If f, when constrained to a surface S, has a maximum or minimum 
at xo, then Vf (xo) is perpendicular to S at xo (see Figure 3.4.2). 


z Brad Axo Yo, zo) = VAXo Yo zo) 


Plane tangent to S 


yes 


Surface S 


Figure 3.4.2 The geometry of constrained 
extrema. 


These results tell us that in order to find the constrained extrema of f, we must 
look among those points xo satisfying the conclusions of these two theorems. We 
shall give several illustrations of how to use each. 

When the method of Theorem 8 is used, we look for a point xo and a constant 
A, called a Lagrange multiplier, such that Vf (xo) = AVg(xo). This method is more 
analytic in nature than the geometric method of Theorem 9. Surprisingly, Euler in- 
troduced these multipliers in 1744, some 40 years before Lagrange! 
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Equation (1) says that the partial derivatives of f are proportional to those of g. 
Finding such points xo at which this occurs means solving the simultaneous equations 


9 9, 
B aiiis =r Q, vey %n) 
xi Ox) 
a 9, 
aLQu a) = AE st) 
: (2) 
af ag 
Gig, bise ls nad] 
SC, 0+ x4) He 


for xi, ..., x, and A. 
Another way of looking at these equations is as follows: Think of A as an addi- 
tional variable and form the auxiliary function 


hlær -<-s Xn A) = fas... Xn) — Aleli - -s Xn) — c]. 


The Lagrange multiplier theorem says that to find the extreme points of f|S, we 
should examine the critical points of /. These are found by solving the equations 


m ah Of ag 
~ ax; Oxy Ox) 


_ oh of 7 dg 
(0x,  OXn OXn 


which are the same as equations (2) above. 

Second derivative tests for maxima and minima analogous to those in Section 3.3 
will be given in Theorem 10 later in this section. However, in many problems it is pos- 
sible to distinguish between maxima and minima by direct observation or by geometric 
means. Because this is often simpler, we consider examples of the latter type first. 


| EXAMPLE I| Let S C R? be the line through (—1, 0) inclined at 45°, and let 
f: R? —> R, (x, y) eo x? + y?. Find the extrema of fS. 


SOLUTION Here S = (x, y) | y ^x — 1 = 0}, and therefore we set g(x, y) = 
y—x — l and c = 0. We have Vg(x, y) = —i + j z 0. The relative extrema of /|S 
must be found among the points at which Vf is orthogonal to S, that is, inclined at 
—45*. But Vf(x, y) = (2x, 2y), which has the desired slope only when x = —y, or 
when (x, y) lies on the line L through the origin inclined at —45°. This can occur 
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in the set S only for the single point at which L and S intersect (see Figure 3.4.3). 
Reference to the level curves of f indicates that this point, (— 1/2, 1/2), is a relative 
minimum of fS (but not of f). 

Notice that in this problem, f on S has a minimum but no maximum. A 


a Level set f= | 


Figure 3.4.3 The geometry associated with 
finding the extrema of f(x, y) = x? + y? 


Levels Ed restricted to S = ((x, y) | y — x — 1 = 0]. 


Let f: R? > R, (x, y) 9 x? — y?, and let S be the circle of 
radius | around the origin. Find the extrema of fS. 


SOLUTION The set S is the level curve for g with value 1, where g: IR? > R, 
(x, y) e» x? + y?. Because both of these functions have been studied in previous 
examples, we know their level curves; these are shown in Figure 3.4.4. In two dimen- 
sions, the condition that Vf = AVg at xo, that is, that Vf and Vg are parallel at xo is 
the same as the level curves being tangent at xo (why?). Thus, the extreme points of 
fS are (0, +1) and (+1, 0). Evaluating f, we find (0, +1) are minima and (+1, 0) 
are maxima. 


Figure 3.4.4 The geometry associated with the problem of 
finding the extrema of x? — y? on S = ((x, y) | x? +y? « 1]. 
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Let us also do this problem analytically by the method of Lagrange multipliers. 
Clearly, 


af of 


=, 2) —(2x,—2y) and Vg(x, y) = (2x, 2y). 
ax’ dy 


vps. = ( 


Note that Ve(x, y) Æ 0 if x? + y? = 1. Thus, according to the Lagrange multiplier 
theorem, we must find a à such that 


Qx,—2y)2AQx,2y) and  (y)eS, ie, x?+y?=1. 


These conditions yield three equations, which can be solved for the three un- 
knowns x, y, and A. From 2x = A2x we conclude that either x = O or à = 1. Ifx = 0, 
then y = +1 and —2y = A2y implies A 1. If A= I, then y 2 0 and x = +1. 
Thus, we get the points (0, +1) and (+1, 0), as before. As we have mentioned, this 
method only locates potential extrema; whether they are maxima, minima, or nei- 
ther must be determined by other means, such as geometric arguments or the second 
derivative test given below.!! — A 


[EXAMPLE 3] Maximize the function f(x, y, z) — x 4- z subjectto the constraint 


X^4 y^ dz. 


SOLUTION By Theorem 7 we know that the function f restricted to the unit 
sphere x? + y? +z? = 1 has a maximum (and also a minimum). To find the maximum, 
we again use the Lagrange multiplier theorem. We seek À and (x, y, z) such that 


] 22x23, 0-2), and 1=2z,, 
and 
x+y +l. 


From the first or the third equation, we see that à zz 0. Thus, from the second equation, 
we get y — 0. From the first and third equations, x — z, and so from the fourth, 
x = £1/4/2 =z. Hence, our points are (1/4/2, 0, 1/42) and (-1/4/2, 0, —1/4/2). 
Comparing the values of f at these points, we can see that the first point yields the 
maximum of f (restricted to the constraint) and the second the minimum. A 


| DSW BE] Assume that among all rectangular boxes with fixed surface area 
of 10 square meters there is a box of largest possible volume. Find its dimensions. 


SOLUTION If x, y, andz are the lengths ofthe sides, x > 0, y > 0, z > 0, respec- 
tively, and the volume is f(x, y, z) = xyz. The constraint is 2(xy + xz + yz) = 10; 


!!In these examples, Vg(xo) # 0 on the surface S, as required by the Lagrange multiplier theorem. If Vg(xo) were zero 
for some xo on 5, then it would have to be included among the possible extrema. 


3.4 Constrained Extrema and Lagrange Multipliers 231 


that is, xy + xz + yz = 5. Thus, the Lagrange multiplier conditions are 


yz — Xy +2) 
xz = A(x +z) 
xy =My +x) 


xy +xz +yz =5. 


First ofall, x 4 0, because x = 0 implies yz = 5 and 0 = Az, so that A = 0 and we get 
the contradictory equation yz = 0. Similarly, y # 0, z # 0, x + y Æ 0. Elimination 
of A from the first two equations gives yz/( y + z) = xz/(x + z), which gives x = y; 
similarly, y = z. Substituting these values into the last equation, we obtain 3x? = 5, 
orx = 5/3. Thus, we get the solution x = y = z = V553, and xyz = (5/3%%. 
This (cubical) shape must therefore maximize the volume, assuming there is a box of 
maximum volume. A 


Existence of Solutions 


We should note that the solution to Example 4 does not demonstrate that the cube is 
the rectangular box of largest volume with a given fixed surface area; it proves that 
the cube is the only possible candidate for a maximum. We shall sketch a proof that 
it really is the maximum later. The distinction between showing that there is only one 
possible solution to a problem and that, in fact, a solution exists is a subtle one that 
many (even great) mathematicians have overlooked. 

Queen Dido (ca. 900 B.C.) realized that among all planar regions with fixed 
circumference, the disc is the region of maximum area. It is not difficult to prove 
this fact under the assumption that there is a region of maximum area; however, 
proving that such a region of maximum area exists is quite another (difficult) matter. 
A complete proof was not given until the second half of the nineteenth century by the 
German mathematician Weierstrass. 

Let us consider a nonmathematical parallel to this situation. Put yourself in the 
place of Lord Peter Wimsey, Dorothy Sayers’ famous detective: 


“Undoubtedly,” said Wimsey, “but if you think that this identification is 
going to make life one grand, sweet song for you, you are mistaken. . . . 
Since we have devoted a great deal of time and thought to the case on the 
assumption that it was murder, it's a convenience to know that the 
assumption is correct." 


Wimsey has found the body of a dead man, and after some time has located 
ten suspects. He is sure that no one else other than one of the suspects could be the 
murderer. By collecting all the evidence and checking alibis, he then reduces the 
number of suspects one by one, until, finally, only the butler remains; hence he is 
the murderer! But wait, Peter is a very cautious man. By checking everything once 
again, he discovers that the man died by suicide; so there is no murder. You see the 
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point: It does not suffice to find a clear and uniquely determined suspect in a criminal 
case where murder is suspected; you must prove that a murder actually took place. 

The same goes for our cube; the fact that it is the only possible candidate for a 
maximum does not prove that it is maximum. (For more information see The Parsi- 
monious Universe: Shape and Form in the Natural World, by S. Hildebrandt and A. 
Tromba, Springer-Verlag, New York/Berlin, 1995.) 

The key to showing that f(x, y, z) — xyz really has a maximum lies in the fact 
that f is a continuous function that is defined on the unbounded surface S: xy + xz + 
yz = 5, and not on a bounded set, which includes its boundary, where Theorem 7 of 
Section 3.3 would apply. We have already seen problems of this sort for functions of 
one and two variables. 

The way to show that f(x, y, z) — xyz > 0 does indeed have a maximum on 
Xy + yz + xz = 5 is to show that if either x, y, or z tend to co, then f(x, y, z) > 0. 
We may then conclude that the maximum of f on S must exist by appealing to 
Theorem 7 (the student should supply the details). So, suppose (x, y, z) lies in S and 
x — oo, then y > 0 and z > 0 (why?). Multiplying the equation defining S by z 
we obtain the equation x yz + xz? + yz = 5z — Oasx — oo. Because x, y, z > 0, 
xyz = f(x, y, z) > 0. Similarly, f(x, y, z) > 0 if either y or z tend to oo. Thus, a 
box of maximum volume must exist. 

Some general guidelines may be useful for maximum and minimum problems 
with constraints. First of all, if the surface S is bounded (as an ellipsoid is, for ex- 
ample), then f must have a maximum and a minimum on S. (See Theorem 7 in the 
preceding section.) In particular, if f has only two points satisfying the conditions 
of the Lagrange multiplier theorems or Theorem 9, then one must be a maximum 
and one must be a minimum. Evaluating f at each point will tell the maximum from 
the minimum. However, if there are more than two such points, some can be saddle 
points. Also, if S is not bounded (for example, if it is a hyperboloid), then f need not 
have any maxima or minima. 


Several Constraints 


Ifa surface S is defined by a number of constraints, namely, 


gi +--+, Xn) = C1 
22(%1, - -+ Xn) = C2 
: ; (4) 


gas... Xn) = Ck 


then the Lagrange multiplier theorem may be generalized as follows: /f f has a 
maximum or a minimum at xo on S, there must exist constants A1, . . . , A4 such that? 


Vf (xo) = 21Vgi(Xo) + - - - + 24 Vgi(Xo). (5) 


"As with the hypothesis Vg(xo) #0 in the Lagrange multiplier theorem, here one must assume that the vectors 
Vgi(xo). ---, Vge(Xo) are linearly independent; that is, each Vg; (xo) is nota linear combination of the other Vg, (xo), j i. 
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This case may be proved by generalizing the method used to prove the Lagrange 
multiplier theorem. Let us give an example of how this more general formulation is 
used. 


DVB Find the extreme points of f(x, y, z) =x + y + z subject to the 
two conditions x? + y? = 2 and x +z = 1. 


SOLUTION Here there are two constraints: 
gx, y, z) —x4?-2-20 and g(x, y,z) =x+z—-1=0. 
Thus, we must find x, y, z, à1, and A? such that 


Vf (x, y, z) = 31VgiGx, y, z) + A2 Vgo(x, y, Z), 
gi(x, y. z) — 0, and gx(x, y, z) — 0. 


Computing the gradients and equating components, we get 


1 224-2x 4-321, 
] 224*:2y 25-0, 
1=A,-0+A2°1, 

x? 4.57 = 2, and bes 


These are five equations for x, y, Z, A1, and Az. From the third equation, 4? = 1, and 
so 2xA, = 0, 2yA, = 1. Because the second implies A; 4 0, we have x = 0. Thus, 
y = xA/2 and z = 1. Hence, the possible extrema are (0, +V/2, 1). By inspection, 
(0, V2, 1) gives a relative maximum, and (0, —4/2, 1) a relative minimum. 

The condition x? + y? = 2 implies that x and y must be bounded. The condition 
x +z = l implies that z is also bounded. If follows that the constraint set S is closed 
and bounded. By Theorem 7 it follows that f has a maximum and minimum on S 
that must therefore occur at (0, 4/2, 1) and (0, —/2, 1), respectively. A 


The method of Lagrange multipliers provides us with another tool to locate the 
absolute maxima and minima of differentiable functions on bounded regions in IR? 
(see the strategy for finding absolute maximum and minimum in Section 3.3). 


[D.C WIESE Find the absolute maximum of f(x, y) = xy on the unit disk D, 
where D is the set of points (x, y) with x? + y? < 1. 


SOLUTION By Theorem 7 of Section 3.3, we know the absolute maximum 
exists. First, we find all the critical points of f in U, the set of points (x, y) with 
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x? +y < 1. Because 


(0, 0) is the only critical point of f in U. Now consider f on the unit circle, the level 
curve g(x, y) = 1, where g(x, y) = x? + y?. To locate the maximum and minimum 
of f on C, we write down the Lagrange multiplier equations: Vf (x, y) = (y, x) = 
ANg(x, y) = A(2x, 2y) and x? + y? = 1. Rewriting these in component form, we get 


y222x; 
x —2Ay, 
x +y =1 
Thus, 
y=4y, 


or à = «1/2 and y = +x, which means that x? +x? 22x? = 1 or x = 1/42, 
y = +1//2. On C we compute four candidates for the absolute maximum and min- 
imum, namely, 


1 1 1 1 1 1 1 1 
( J2' a) ( J2 A) v a) (s a) 
The value of f at both (—1/./2, —1/4/2) and (1/42, 1/4/2) is 1/2. The value of f 
at (—1/42, 1/4/2) and (1//2, —1/4/2) is —1/2, and the value of f at (0, 0) is 0. 
Therefore, the absolute maximum of f is 1/2 and the absolute minimum is —1/2, 
both occurring on C. At (0, 0), 3?f/3x? = 0, 89?//8y? = 0 and 3?f/ðx dy = 1, so the 
discriminant is — 1 and thus (0, 0) is a saddle point. A 


EXAMPLE 7| Find the absolute maximum and minimum of f(x, y) — 3x? + 
iy? in the elliptical region D defined by 1x? + y? < 1. 


SOLUTION Again by Theorem 7, Section 3.3, the absolute maximum exists. We 
first locate the critical points of f in U, the set of points (x, y) with $x? + y? < 1. 
Because 


af af 
= =x Icy 
ox oy 
the only critical point is the origin (0, 0). 

We now find the maximum and minimum of f on C, the boundary of U, which 
is the level curve g(x, y) = 1, where g(x, y) = ix? + y?. The Lagrange multiplier 
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equations are 


Vf, y) = x, y) = AVE, y) = A(x, 2y) 


and (x?/2) + y? = 1. In other words, 


x =x 
y=2dy 
2 
€ 2 
pa ex 
2ty 


If x = 0, then y 1 and à = 1. If y = 0, then x = 4/2 anda = 1. If x #0 and 
y #0, we get both A = 1 and 1/2, which is impossible. Thus, the candidates for 
the maxima and minima of f on C are (0, +1), (2-2, 0) and for f inside D, the 
candidate is (0, 0). The value of f at (0, +1) is 1/2, at (+./2, 0) it is 1, and at (0, 0) 
it is 0. Thus, the absolute minimum of f occurs at (0, 0) and is 0. The absolute 
maximum of f on D is thus 1 and occurs at the points (4-4/2,0). A 


Global Maxima and Minima 


The method of Lagrange multipliers enhances our techniques for finding global max- 
ima and minima. In this respect, the following is useful. 


DEFINITION Let U be an open region in R” with boundary dU. We say that 
aU is smooth if AU is the level set of a smooth function g whose gradient Vg never 
vanishes (i.e., Vg # 0). Then we have the following strategy. 


Lagrange Multiplier Strategy for Finding Absolute Maxima and 
Minima on Regions with Boundary Let f be a differentiable function on 
a closed and bounded region D — UUQU,U open in R”, with smooth 
boundary aU. 

To find the absolute maximum and minimum of f on D: 


(i) Locate all critical points of f in U. 


(ii) Use the method of Lagrange multiplier to locate all the critical points of 
f\au. 


(iii) Compute the values of f at all these critical points. 


(iv) Select the largest and the smallest. 


EXAMPLE 8| Find the absolute maximum and minimum of the function 
f(x, y,z) =x + y +z on the set D = {(x, y, z) | x? +y? +27 < 1}. 
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SOLUTION As in the previous examples, we know the absolute maximum and 
minimum exists. Now D = U U dU, where 


U = (xy z) |x? -y +2 <1} 


and 


aU = (6, y, z) |x? +y? +2 = M. 


The gradient of f is Vf = (1, 1, 1), andso f has no critical points in U. Therefore, 
the maximum and minimum values of f must occur on dU. 

Let g(x, y, z) = x? + y? + z?. Then 3U is the level set g(x, y, z) = 1. By the 
method of Lagrange multipliers, the maximum and minimum must occur at a critical 
point of f|ðU, that is, at a point xg where Vf (xo) = AV g(xo) for some scalar A. 

Thus, 


(1, 1, 1) = A(Q2x, 2y, 2z); that is, x ; y 


Because x? + y? +z? = 1, we obtain à = +V/3/2 and so xo = +(1/V3, 1/V3, 
1/4/3). Clearly, —(1/V3, 1/3, 1/4/3) is the point where f assumes its absolute 
minimum (namely, —/3) and (1/4/3, 1/4/3, 14/3), the point where f assumes its 
maximum value X3. A 


Two Additional Applications 


We now present two further applications of the mathematical techniques developed in 
this section to geometry and to economics. We shall begin wth a geometric example. 


AVIADA] Suppose we have a curve defined by the equation 
p(x, y) = Ax? + 2Bxy - Cy! — 1— 0. 
Find the maximum and minimum distance of the curve to the origin. (These are the 


lengths of the semimajor and the semiminor axis of this quadric.) 
SOLUTION The problem is equivalent to finding the extreme values of f(x, y) = 
x? + y? subject to the constraining condition $(x, y) = 0. Using the Lagrange mul- 
tiplier method, we have the following equations: 
2x + AQ Ax + 2By) 2 0 (6) 
2y + AQBx + 2Cy) 20 (7) 
Ax? + 2Bxy + Cy? =1. (8) 
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Adding x times equation (6) to y times equation (7), we obtain 
2(x? + y?) + 22 (Ax? + 2Bxy + Cy?) = 0. 


By equation (8), it follows that x? + y? +A = 0. Let t = —1/A = 1/(x? + y?) [the 
case A = 0 is impossible, because (0, 0) is not on the curve (x, y) = 0]. Then 
equations (6) and (7) can be written as follows: 


2(A — t +2By 20 


(9) 
2Bx +2(C — t)y =0. 
If these two equations are to have a nontrivial solution [remember that (x, y) — (0, 0) 
is not on our curve and so is not a solution], it follows from a theorem of linear algebra 
that their determinant vanishes:? 
A-t B 
| B C-t) 9 


Because this equation is quadratic in t, there are two solutions, which we shall call 
tı and fp. Because —A = x? + y?, we have /x? + y? = V/—4. Now yx? + y? is the 
distance from the point (x, y) to the origin. Therefore, if (x1, yı) and (x2, y2) denote 
the nontrivial solutions to equation (9) corresponding to ¢; and t», and if t; and t are 
positive, we get x3 + y? = 1//h and yx? + y? = 1/,/f. Consequently, if tj > t, 
the lengths of the semiminor and semimajor axes are 1/,/f, and 1/,/f, respectively. 
If the curve is an ellipse, both t; and t» are, in fact, real and positive. What happens 


with a hyperbola or a parabola? A 


Finally, we discuss an application to economics. 


Suppose that the output of a manufacturing firm is a quantity Q 
ofa certain product, where Q is a function f(K, L), where K is the amount of capital 
equipment (or investment) and L is the amount of labor used. If the price of labor is 
p, the price of capital is q, and the firm can spend no more than B dollars, how can 
we find the amount of capital and labor to maximize the output Q? 


SOLUTION We would expect that if the amount of capital or labor is increased, 
then the output Q should also increase; that is, 


a2 0 and EDT 


Q5 


“The matrix of coefficients of the equations cannot have an inverse, because this would imply that the solution is zero. 
Recall that a matrix that does not have an inverse has determinant zero. 
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We also expect that as more labor is added to a given amount of capital equipment, 
we get less additional output for our effort; that is, 


a?Q 
JL <0. 


Similarly, 


#0 


JK <0. 


With these assumptions on Q, it is reasonable to expect the level curves of out- 
put (called isoquants) Q(K, L) — c to look something like the curves sketched in 
Figure 3.4.5, with c1 < c» < cs. 


Figure 3.4.5 What is the largest 
value of Q in the shaded triangle? 


We can interpret the convexity of the isoquants as follows: As one moves to the 
right along a given isoquant, it takes more and more capital to replace a unit of labor 
and still produce the same output. The budget constraint means that we must stay 
inside the triangle bounded by the axes and the line pL +q K = B. Geometrically, it 
is clear that we produce the most by spending all our money in such a way as to pick 
the isoquant that just touches, but does not cross, the budget line. 

Because the maximum point lies on the boundary of our domain, we apply the 
method of Lagrange multipliers to find the maximum. To maximize Q — f(K, L) 
subject to the constraint pL + qK = B, we look for critical points of the auxiliary 
function, 


h(K,L,X)- f(K, L) - (pL +qK — B). 
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Thus, we want 
a 
sp M4: ay MD, and pL+qK =B. 


These are the conditions we must meet in order to maximize output. (The reader is 
asked to work out a specific case in Exercise 31.) A 


In the preceding example, A represents something interesting. Let k = qK and 
l = pL, so that k is the dollar value of the capital used and / is the dollar value of the 
labor used. Then the first two equations become 


80 100 , 130 _a0 
ak  qOK pL al” 


Thus, at the optimum production point the marginal change in output per dollar’s 
worth of additional capital investment is equal to the marginal change of output per 
dollar’s worth of additional labor, and A is this common value. At the optimum point, 
the exchange of a dollar's worth of capital for a dollar's worth of labor does not change 
the output. Away from the optimum point the marginal outputs are different, and one 
exchange or the other will increase the output. 


A Second Derivative Test for Constrained Extrema 


InSection3.3, we developed a second derivative test for extrema of functions of several 
variables by looking at the second-degree term in the Taylor series of f. If the Hessian 
matrix of second partial derivatives is either positive-definite or negative-definite at 
a critical point of f, this point is a relative minimum or maximum, respectively. 

The question naturally arises as to whether there is a second derivative test for 
maximum and minimum problems in the presence of constraints. The answer is yes 
and the test involves a matrix called a bordered Hessian. We will first discuss the test 
and how to apply it for the case of a function f(x, y) of two variables subject to the 
constraint g(x, y) — c. 


THEOREM 10 Let f: U c R? > Rand g: U C R? — R be smooth (at least 
C?) functions. Let vo € U, g(vo) = c, and S be the level curve for g with value c. 
Assume that Vg(vo) # 0 and that there is a real number A such that Vf (vo) = AVg(vo). 
Form the auxiliary function h = f — Ag and the bordered Hessian determinant 


evaluated at vo. 
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(i) If |A| > 0, then vo is a local maximum point for f |S. 
(ii) If |F] < 0, then vo is a local minimum point for f|S. 


(iii) If |H| — 0, the test is inconclusive and vo may be a minimum, a maximum, or 
neither. 


This theorem is proved in the Internet supplement for this section. 


|, WIESEN Find extreme points of f(x, y) = (x — y)" subject to the con- 


straint x? + y? = 1, where n > 1. 


SOLUTION We set the first derivatives of the auxiliary function h defined by 
h(x, y, A) = (x — y)" — A(x? + y? — 1) equal to 0: 


n(x — yy! —2ax 20 
—n(x — yy! —2ay = 0 
x? +y?—1)=0. 


From the first two equations we see that A(x + y) = 0.IfA = 0,thenx = y = +/2/2. 
If A Æ 0, then x = —y. The four critical points are represented in Figure 3.4.6 and 
the corresponding values of f(x, y) are listed below: 


(A) x=V2/2 | y=V/2/2 X-0 f@,y)=0 
(B x-242/2 y=—-V2/2 h=n(V 2"? Sey) =(V2" 
(O x-2-42/2 y--42/ x20 Sœ, y)=0 


(D x2-42/2 yz42/2  a=(-1} nV — fGy)-(-V2y. 


—PÀ x Figure 3.4.6 The four critical points in Example 11. 


By inspection, we see that if n is even, then A and C are minimum points and B 
and D are maxima. If 5 is odd, then B is a maximum point, D is a minimum, and A and 
C are neither. Let us see whether Theorem 10 is consistent with these observations. 
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The bordered Hessian determinant is 


0 —2x —2y 
|A| =|-2x  nn—1)(x—yy?-24 n(n — 1)(x — yy? 
—2y —n(n —1)(x — yy? n(n — 1)(x — yy"? — 2a 


= —4n(n — 1)(x — y)? (x + y 4-816? — y?). 


If n = 1 or if n > 3, |H| — 0 at A, B, C, and D. If n = 2, then |H| = 0 at B and D 
and —16 at A and C. Thus, the second-derivative test picks up the minima at A and C, 
but is inconclusive in testing the maxima at B and D for n = 2. It is also inconclusive 
for all other values of n. A 


Just as in the unconstrained case, there is also a second-derivative test for functions 
of more than two variables. If we are to find extreme points for f (x1, ... , Xn) subject 
to a single constraint g(x1,..., Xn) = c, we first form the bordered Hessian for the 


auxiliary function h(x), ..., Xn) = f(xi, -< <, Xa) — A(g(@1, ..., Xn) — c) as follows: 

0 —8g —üg 08, 
Ox) Ox2 OXn 
—ag 97h 9?h 97h 

Ox) ax? Ox, 0x2 0x1 0x, 
—dg ah ah ah 

Ox2 Ox) Ox2 əx? 0x»? 0X, 
—üg əh əh 8?h 
Ox,  Oxji0üx, 3X2 3Xn ax? 


Second, we examine the determinants of the diagonal submatrices of order >3 at the 
critical points of h. If they are all negative, that is, if 


0 ag ag ag 

0 _ 9g _ og 0x1 0x2 0x3 
x, 8x2 ag 8h 82h ah 

ag 8?h 8?h əxi 8x?  ðxıðxı  0xi0x 

ðxı xp — Ax ðx2 9 ag h 82h gy (Oen 

üg ah 9?h Ax.  0xj0x2 ax? 0x2 Ox; 
“x2 Axx — ax? ag ah ah 9?h 
üx;  Oxj0x3 9x2 Ax3 ax} 


then we are at a local minimum of f |S. If they start out with a positive 3 x 3 subde- 
terminant and alternate in sign (that is, >0, <0, >0, <0, ...), then we are at a local 
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maximum. If they are all nonzero and do not fit one of these patterns, then the point 
is neither a maximum nor a minimum (it is said to be of the saddle type).'^ 


DAVIDE] Study the local extreme points of f(x, y, z) = xyz on the surface 
of the unit sphere x? + y? +z? = 1 using the second-derivative test. 


SOLUTION Setting the partial derivatives of the auxiliary function 
h(x, y, z, À) = xyz — A(x? + y? + z? — 1) equal to zero gives 


yz =2dx 
xz =2dy 
xy =2dz 


xy 427 =1, 


Thus, 3xyz = 2A(x? + y? +z?) = 24. If A=0, the solutions are (x, y, z, 4) = 
(+1, 0, 0, 0), (0, +1, 0, 0), and (0, 0, +1, 0). If A # 0, then we have 2A = 3xyz = 
6Az? and so z? = i. Similarly, x? = y? = į. Thus, the solutions are given by 
A= ixyz = +,/3/6. The critical points of h and the corresponding values of f 
are given in Table 3.1. From it, we see that points E, F, G, and K are minima. Points 
D, H, I, and J are maxima. To see whether this is in accord with the second-derivative 


The critical points A, B, . . . , J, K of and corresponding values of f 


x y z A F(X; yz) 
+A +1 0 0 0 0 
+B 0 a 0 0 0 
SEC 0 0 +1 0 0 
D 43/3 43/3 43/3 43/6 43/9 
E -43/3 43/3 3/3 —/3/6 —/3/9 
F J3/3 —3/3 3/3 —4/3/6 —43/9 
G J/3/3 43/3 -v3/3 —J3/6 —V3/9 
H 43/3 —43/3 —43/3 43/6 3/9 
I -V3/3 43/3 —/3/3 3/6 43/9 
J —V/3/3 —v3/3 43/3 43/6 J3/9 
K —45/3 —453/3 —/3/3 —4/6 —J3/9 


V For a detailed discussion, see C. Caratheodory, Calculus of Variations and Partial Differential Equations, Holden-Day, 
San Francisco, 1965; Y. Murata, Mathematics for Stability and Optimization of Economic Systems, Academic Press, New 
York, 1977, pp. 263-271; or D. Spring, Am. Math. Mon. 92 (1985): 631-643. 
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test, we need to consider two determinants. First, we look at the following: 


0 —dg/dx | —0g/8y 0 —2x -—2y 
|H;|2|-8g/àx  8?h/8x?  8?/üx8y| =|—2x —24 z 
—dg/dy a*h/ax dy a7h/ay? —2y 2  —2À 


= 8Ax? + BAy? + Bxyz = BA(x? + y? + 227). 


Observe that sign (|/75]) = sign 4 = sign (xyz), where the sign of a number is | if 
that number is positive, or is —1 if that number is negative. Second, we consider 


0 dg/dx dg/dy dg/dz 
—dg/ax  a*h/ax* 8?h/dx dy 8*h/ax az 
—dg/dy Oh/OxOy  90?h/9y?  O?h/8y az 
—üg/óz QO?^h/ü0x80z a*h/dydz  O?hJoz 

0 -2x -2y —2z 
—2x —24 z y 
-2y 2 -2 x |’ 


which works out to be +4 at points +A, +B, and +C and -i at the other eight 
points. At E, F, G, and K, we have || < 0 and |H3| < 0, and so the test indicates 
these are local minima. At D, H, I, and J we have |H2| > 0 and |H3| < 0, and so the 
test says these are local maxima. Finally, the second-derivative test shows that +A, 
+B, and +C are saddle points. A 


EXERCISES 


In Exercises 1 to 5 find the extrema of f subject to the stated constraints. 
1. f(x, y, z) =x — y +z, subject tox? + y? +z? 22 

2. f(x, y) =x — y, subject to x? - y? = 2 

3. f(x, y) =x, subject to x? + 25? — 3 

4. f(x, yz) ox yz, subject tox? —y?=1,2x+z=1 

5. f(x, y) = 3x + 2y, subject to 2x? + 3? 23 


Find the relative extrema of f|S in Exercises 6 to 9. 


6 f: R — R, (x,y) x? y^, S= {(x,2)|x € R} 


7. f: RoR (x,y) x +y, S= (ey) ly z 2] 
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8. f: R? > R,(x,y)e x? y^, S = ((x, cosx) | x € R} 
9. f: R >R, œ, y z) x^ y£zSsS-l(Qyznlzz24x xy 


10. Use the method of Lagrange multipliers to find the absolute maximum and minimum 
values of f(x, y) = x? + y? — x — y + 1 on the unit disk (see Example 10 of Section 3.3). 


11. Consider the function f(x, y) = x? + xy + y? defined on the unit disk, namely, 

D = ((x, y) | x? + y? < 1). Use the method of Lagrange multipliers to locate the maximum 
and minimum points for f on the unit circle. Use this to determine the absolute maximum 
and minimum values for f on D. 


12. A rectangular box with no top is to have a surface area of 16 m°. Find the dimensions 
that maximize its volume. 


13. Design a cylindrical can (with a lid) to contain 1 liter (— 1000 cm?) of water, using the 
minimum amount of metal. 


14. Show that solutions of equations (4) and (5) are in one-to-one correspondence with the 
critical points of 


A(x,---5 Eas As Ag) = fas.) — Mg Gas x2) — 6i] 
—e Agent, --- x) — es]. 


15. Find the absolute maximum and minimum for the function f(x, y, z) =x + y — z on 
the ball B = ((x, y, z) | x? +y? +2? < 1). 


16. Repeat Exercise 15 for f(x, y, z) =x + yz. 


17. A rectangular mirror with area A square feet is to have trim along the edges. If the trim 
along the horizontal edges costs p cents per foot and that for the vertical edges costs q cents 
per foot, find the dimensions that will minimize the total cost. 


18. An irrigation canal in Arizona has concrete sides and bottom with trapezoidal cross 
section of area A = y(x + y tan 0) and wetted perimeter P = x + 2y/ cos, where x = 
bottom width, y = water depth, 0 = side inclination, measured from vertical. The best design 
for a fixed inclination 0 is found by solving P — minimum subject to the condition 

A = constant. Show that y? = (A cos0)/(2 — sin 0). 


19. Apply the second-derivative test to study the nature of the extrema in Exercises | and 5. 
20. A light ray travels from point A to point B crossing a boundary between two media (see 


Figure 3.4.7). In the first medium its speed is v;, and in the second it is vz. Show that the trip 
is made in minimum time when Snell ’s law holds: 


sin 6, Ui 


sin6; v? 
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+x  Figure3.4.7 Snell's law of refraction. 


21. A parcel delivery service requires that the dimensions of a rectangular box be such that 
the length plus twice the width plus twice the height be no more than 108 inches (/ + 2w+ 
2h < 108). What is the volume of the largest-volume box the company will deliver? 


22. Let P be a point on a surface S in IR? defined by the equation f(x, y, z) = 1, where f is 
of class C. Suppose that P is a point where the distance from the origin to S is maximized. 
Show that the vector emanating from the origin and ending at P is perpendicular to S. 


23. Let A be a nonzero symmetric 3 x 3 matrix. Thus, its entries satisfy a;; = a;;. Consider 
the function f(x) — 3(Ax) +x. 

(a) What is Vf? 

(b) Consider the restriction of f to the unit sphere S = {(x, y, z) | x? + y? +2? = 1} in 
IR3. By Theorem 7 we know that f must have a maximum and a minimum on S. Show that 
there must be an x € S and a A # 0 such that Ax = Ax. (The vector x is called an eigenvector, 
while the scalar A is called an eigenvalue.) 

(c) What are the maxima and minima for f on B = ((x, y. z) | x +y? +2? < 1? 


24. Suppose that A in the function f defined in Exercise 23 is not necessarily symmetric. 


(a) What is Vf? 
(b) Can one conclude the existence of an eigenvector and eigenvalues as in Exercise 23? 


25. (a) Find the critical points of x + y? subject to the constraint 2x? + y? = 1. 
(b) Use the bordered Hessian to classify the critical points. 


26. Answer the question posed in the last line of Example 9. 

27. Try to find the extrema of xy + yz among points satisfying xz = 1. 

28. A company’s production function is Q(x, y) = xy. The cost of production is C(x, y) = 
2x + 3y. If this company can spend C(x, y) = 10, what is the maximum quantity that can be 


produced? 


29. Find the point on the curve (cos t, sin £, sin(t/2)) that is farthest from the origin. 
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30. A firm uses wool and cotton fiber to produce cloth. The amount of cloth produced is 
given by Q(x, y) 2 xy — x — y + 1, where x is the number of pounds of wool, y the number 
of pounds of cotton, x > 1, and y > 1. If wool costs p dollars per pound, and cotton q dollars 
per pound and the firm can spend B dollars on material, what should the ratio of cotton and 
wool be to produce the most cloth? 


31. Carry out the analysis of Example 10 for the production function Q(K, L) = AK*L!^*, 
where A and o are positive constants and 0 < œ < 1. This is called a Cobb-Douglas 
production function and is sometimes used as a simple model for the national economy. 

Q is then the aggregate output of the economy for a given input of capital and labor. 


3.5 The Implicit Function Theorem 


In this section, we state two versions of the implicit function theorem, arguably the 
most important theorem in all of mathematical analysis. The entire theoretical basis 
of the idea of a surface as well as the method of Lagrange multipliers depends on 
it. Moreover, it is a cornerstone of several fields of mathematics, such as differential 


topology and geometry. 


The One-Variable Implicit Function Theorem 


In one-variable calculus, we learn the importance of the inversion process. For exam- 
ple, x = In y is the inverse of y = e*, and x = sin“! y is the inverse of y = sin x. The 
inversion process is also important for functions of several variables; for example, the 
switch between Cartesian and polar coordinates in the plane involves inverting two 
functions of two variables. 

Recall from one-variable calculus that if y = f(x) isa C! function and f’(xo) 40, 
then locally near xo we can solve for x to give the inverse function: x = f~'(y). We 
learn that (f^ )'(y) = 1/f"(x); that is, dx/dy = 1/(dy/dx). That y = f(x) can be 
inverted is plausible because f’(xo) # 0 means that the slope of y = f(x) is nonzero, 
so that the graph is rising or falling near xo. Thus, if we reflect the graph across the line 
y =X, itis still a graph near (xo, yo) where yo = f(xo). For example, in Figure 3.5.1, 
we can invert y = f(x) in the shaded box, so in this range, x = f~'(y) is defined. 


Figure 3.5.1 If f'(xo) Æ 0, then 


= is i ibl 
Paveni = YCoyis locally invertible. 


------- near (Xo, y) 
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A Special Result 


We next turn to the situation for real-valued functions of variables x;, ..., x, and z. 


THEOREM 11: Special Implicit Function Theorem Suppose that 
F: R'*! — R has continuous partial derivatives. Denoting points in IR"*! by (x, z), 
where x € IR" and z € R, assume that (xo, zo) satisfies 


OF 
F(Xo,zo)— 0 and 3; 09 zo) #0. 
zZ 


Then there is a ball U containing xo in R” and a neighborhood V of zo in R such that 
there is a unique function z = g(x) defined for x in U and z in V that satisfies 

F(x, g(x)) = 0. 
Moreover, if x in U and z in V satisfy F(x, z) — 0, thenz — g(x). Finally, z — g(x) 


is continuously differentiable, with the derivative given by 


1 
De(x) = m D, F(x, z) " 
=~, 2) z-g() 
az 


where D, F denotes the (partial) derivative of F with respect to the variable x, that 
is, we have D,F = [9 F/0xi, ..., 0F/0x,]; in other words, 


ag 9F/0x; 
ax; OF /dz’ 


BE, cist (1) 


A proof of this theorem is given in the Internet supplement. 

Once it is known that z — g(x) exists and is differentiable, formula (1) may be 
checked by implicit differentiation; to see this, note that the chain rule applied to 
F(x, g(x)) = 0 gives 


D,F(x, g(x) + E «e| [Dg(x)] = 0, 


which is equivalent to formula (1). 


EXAMPLE 1| In the special implicit function theorem, it is important to recog- 
nize the necessity of taking sufficiently small neighborhoods U and V. For example, 
consider the equation 


x? +z2—1=0, 


that is, F(x, z) = x? +z? — 1, with n = 1. Here (8F/8z)(x, z) = 2z, and so the spe- 
cial implicit function theorem applies to a point (xo, Zo) satisfying xo + z -1=0 
and zo z 0. Thus, near such points, z is a unique function of x. This function is 
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z= /1— x? ifzo > Oandz = —V/1 — x? ifzo < 0. Note that z is defined for |x| < 1 
only (U must not be too big) and z is unique only if it is near zo (V must not be too 
big). These facts and the nonexistence of 0z/x at zo = 0 are, of course, clear from 
the fact that x? + z? = 1 defines a circle in the xz plane (Figure 3.5.2). A 


Figure 3.5.2 It is necessary to take small 
neighborhoods in the implicit function 
theorem. 


The Implicit Function Theorem and Surfaces 


Let us apply Theorem 11 to the study of surfaces. We are concerned with the level 
set of a function g: U C IR" — R, that is, with the surface S consisting of the set of 
x satisfying g(x) = co, where co = g(xo) and where xo is given. Let us take n = 3 
for concreteness. Thus, we are dealing with the level surface of a function g(x, y, z) 
through a given point (xo, yo, zo). As in the Lagrange multiplier theorem, assume 
that Vg(xo, yo, zo) # 0. This means that at least one of the partial derivatives of 
g is nonzero. For definiteness, suppose that (dg/0z)(xo, yo, zo) # 0. By applying 
Theorem 11 to the function (x, y, z) œ> g(x, y, z) — co, we know there is a unique 
function z = k(x, y) satisfying g(x, y, k(x, y)) = co for (x, y) near (xo, yo) and z near 
Zo. Thus, near Zo the surface S is the graph ofthe function k. Because k is continuously 
differentiable, this surface has a tangent plane at (xo, yo, zo) given by 


ðk ðk 
z — zo (Xo, yo) |(x — xo) + | z— Go. yo) |Gv — yo). (2) 
ax ay 
But by formula (1), 
8 ag 
ak SE Qro» Jo, 20) ak ay 00 Yo, Zo) 
ax (x0, yo) = ag and 3 (Xo, yo) ag : 
= (x0, Yo, Zo) d 7 (xo. Yo, Zo) 
Oz Oz 
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Substituting these two equations into the equation for the tangent plane gives this 
equivalent description: 


9, 9, a 
0 = — zo) Go. yo, 20) + (x — xo) == Go. Yo. zo) + (Y — 36) (xo Yo» Zo); 
z ax oy 


that is, 
(x — xo, y — yo, Z — 20) + Vg(xo, yo. zo) = 0. 


Thus, the tangent plane to the level surface of g is the orthogonal complement to 
Vg(xo, yo, zo) through the point (xo, yo, zo). This agrees with our characterization of 
tangent planes to level sets from Chapter 2. 

We are now ready to complete the proof of the Lagrange multiplier theorem. To 
do this, we must show that every vector tangent to S at (xo, yo, zo) is tangent to a curve 
in S. By Theorem 11, we need only show this for a graph of the form z — K(x, y). 
However, if v = (x — xo, y — yo, Z — zo) is tangent to the graph [that is, if it satisfies 
equation (2)], then v is tangent to the path in S given by 


c(t) = (xo + t(x — xo), yo + ty — yo), K(xo + t(x — xo), yo + t(y — yo))) 


at t = 0. This can be checked by using the chain rule. (See Figure 3.5.3.) 


(xo Jo» 2) S: z= k(x, y) 
\ pe 


H 


The path e(/) Figure 3.5.3 The construction of 
a path e(t) in the surface S whose 
tangent vector is v. 


The line (x9+ t(x — xo), Yo + (v — yo) 


DGB Near what points may the surface 


x? + 3y? + 8x2? - 32y =1 


be represented as a graph of a differentiable function z = k(x, y)? 
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SOLUTION Here we take F(x, y, z) =x? --3y? + 8xz? — 3z°y — 1 and at- 
tempt to solve F(x, y, z) — 0 for z as a function of (x, y). By Theorem 11, this 
may be done near a point (xo, yo, zo) if (8 F/8z)(xo, yo. zo) Æ 0, that is, if 

zo(16xo = 9zoyo) Fx 0, 


which means, in turn, 


zo #0 and 16x9 Æ 9Zoyo. A 


General Implicit Function Theorem 


Next we shall state, without proof, the general implicit function theorem.'? Instead of 
attempting to solve one equation for one variable, we attempt to solve m equations 
for m variables z4, ..., Zm: 


Fixi, Xn Zis.. Zu) —0 
FQ, <e- Xn, Zis- --s Zm) = 0 


Q) 
Fy(xi, Xni Ziya ss Zm) = 0. 


In Theorem 11 we had the condition à F/8z 4 0. The condition appropriate to the 
general implicit function theorem is that A 0,6 where A is the determinant of the 
m x m matrix 


aF anm 
dz) OZm 
OF n OF n 
dz) OZm 


'5For three different proofs of the general case, consult: 


(a) E. Goursat, A Course in Mathematical Analysis, 1, Dover, New York, 1959, p. 45. (This proof derives the general 
theorem by successive application of Theorem 11.) 


(b) T.M. Apostol, Mathematical Analysis, 2d ed., Addison-Wesley, Reading, Mass., 1974. 
(c) J. E. Marsden and M. Hoffman, Elementary Classical Analysis, 2d ed., Freeman, New York, 1993. 


Of these sources, the last two use more sophisticated ideas that are usually not covered until a junior-level course in 
analysis. The first, however, is easily understood by the reader who has some knowledge of linear algebra. 


'©For students who have had linear algebra: The condition A 5 0 has a simple interpretation in the case that F is linear; 
namely, A # 0 is equivalent to the rank of F being equal to m, which in turn is equivalent to the fact that the solution 
space of F = 0 is m-dimensional. 
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evaluated at the point (xo, zo); in the neighborhood of such a point, we can uniquely 
solve for z in terms of x. 


THEOREM 12: General Implicit Function Theorem If ^ ¥ 0, then 


near the point (Xo, zo), equation (3) defines unique (smooth) functions 
z; = ki(xi, a xn) (i — 3 um). 


Their derivatives may be computed by implicit differentiation. 


DABE] Show that near the point (x. y, u, v) = (1, 1, 1, 1), we can solve 


xu 4 yw? =2 


xi) 4 y^ 22 


uniquely for and v as functions of x and y. Compute du/0x at the point (1, 1). 


SOLUTION To check solvability, we form the equations 


Fi(x, y, u, v) = xu + yvu? —2 
Fy(x,y,u,v)— xi 4 yh —2 


and the determinant 


OF, 9F 

A= i ke at (LLLI) 
2 2 
ðu ðv 
x+2yuv yu? 

=| wu gyp] * CLLD 
3 1 

=h 4l=9 


Because A Æ 0, solvability is assured by the general implicit function theorem. To 
find du/dx, we implicitly differentiate the given equations in x using the chain 
rule: 

ðu 


ðv 5 
2 E 
ax + UU 0 


ð ð 
3n 24 + i + ayo Z ~ 
Ox ax 
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Setting (x, y, u, v) = (1, 1, 1, 1) gives 


ðu av 
3—+—=-1 
dx Bx 
ðu ðv 
3— +4— = -l1 
ax iu ax 


Solving for du/dx by multiplying the first equation by 4 and subtracting gives 
du/dx = -i A 


Inverse Function Theorem 


A special case ofthe general implicit function theorem is the inverse function theorem. 
Here we attempt to solve the n equations 


AiG, +++. Xn) =V 
Ue @) 
Sar, ++ -3 Xn) = yn 
for x1, ..., x, as functions of yi, ..., Yn; that is, we are trying to invert the equa- 


tions of system (4). This is analogous to forming the inverses of functions like 
sin x = y and e* = y, with which the reader should be familiar from elementary 
calculus. Now, however, we are concerned with functions of several variables. The 
question of solvability is answered by the general implicit function theorem applied 


to the functions y; — f;i(xi, ..., Xn) with the unknowns x), ... , x, (called zi, ..., Zn 
earlier). The condition for solvability in a neighborhood of a point xo is ^ # 0, where 
A is the determinant of the matrix Df (xo), and f = (fi, ..., fa). The quantity A is 


denoted by (fi, .... f,)/8(i. -+ -p Xn), OF IOP, «+ +5 Yn)/B(1y --- Xn) or JC Qo) 
and is called the Jacobian determinant of f. Explicitly, 


afi afi 
ax 09 cU aL 00 
TEE JC) : : i 
Clos teers.) X—Xo 9f, ah 
ax, (X0) +++ 8x, (xo) 


The reader should note that in the case when f is linear, for example f(x) = Ax, 
where A is an n x n matrix, the condition A Æ 0 is equivalent to the fact that the 
determinant of A, det A 4 0, and from Section 1.5 we know that A, and therefore f, 
has an inverse. 

The Jacobian determinant will play an important role in our work on integration 
(see Chapter 5). The following theorem summarizes this discussion: 
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THEOREM 13: Inverse Function Theorem Let U C R” be open and let 
fi: U>R,..., fa: U — R have continuous partial derivatives. Consider the equa- 
tions (4) near a given solution xo, yo. If J ( f )(xo) [defined by equation (5)] is nonzero, 
then equation (4) can be solved uniquely as x = g(y) for x near xo and y near yo. 
Moreover, the function g has continuous partial derivatives. 


»CWIIE)ES Consider the equations 


=u, sinx + cos y = v. 


Near which points (x, y) can we solve for x, y in terms of u, v? 


SOLUTION Here the functions are u = fi(x, y) = (xt + y^)/x and v= 
a(x, y) = sinx + cos y. We want to know the points near which we can solve for x, y 
as functions ofu and v. According to the inverse function theorem, we must first com- 
pute the Jacobian determinant 9( fi, f2)/8(x, y). We take the domain of f = (fi, f2) 
to be U = ((x, y) € R2 | x # 0}. Now 


Sf Bh 3ax^—y* 4y? 
86h.) |8x ay} |—-— z siny, 4 44 4y° 
——— = = x x = —()" — 3x") — — cosx. 
a@.y) ah ah © el ae 
ax ay cosx —sin y 


Therefore, at points where this does not vanish we can solve for x, y in terms of u 
and v. In other words, we can solve for x, y near those x, y for which x 4 0 and 
(sin y)(y^ — 3x*) 4 4xy? cos x. Such conditions generally cannot be solved explic- 
itly. For example, if xo = 2/2, yo = 2/2, we can solve for x, y near (xo, yo) because 
there, 9(fi, f2)/9(x, y) #0. A 


EXERCISES 


1. Let F(x, y) = 0 define a curve in the xy plane through the point (xo, yo), where F is C. 
Assume that (3 F/8y) (xo, yo) # 0. Show that this curve can be locally represented by the 
graph of a function y = g(x). Show that (i) the line orthogonal to VF (xo, yo) agrees with 
(ii) the tangent line to the graph of y = g(x). 


2. Show that xy + z + 3xz? = 4 is solvable for z as a function of (x, y) near (1, 0, 1). 
Compute 8z/8x and 8z/8y at (1, 0). 


3. (a) Check directly (i.e., without using Theorem 11) where we can solve the equation 
F(x, y) = y? + y - 3x + 1 = 0 for y in terms of x. 
(b) Check that your answer in part (a) agrees with the answer you expect from the 
implicit function theorem. Compute dy/dx. 
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4. Repeat Exercise 3 with F(x, y) = xy? — 2y - x? +2 — 0. 


5. Show that x?z? — z?yx = 0 is solvable for z as a function of (x, y) near (1, 1, 1), but not 
near the origin. Compute 0z/dx and 8z/8y at (1, 1). 


6. Discuss the solvability in the system 


3x+2y+2+u+v=0 
4x+3y+z+w+u+w+2=0 


x+z+wtw?+2=0 


for u, v, w in terms of x, y, z near x = y=z=0,u=v=0, and v = —2. 
7. Discuss the solvability of 
ytx+uv=0 
uxy+v=0 
for u, v in terms of x, y near x = y = u = v = 0 and check directly. 
8. Investigate whether or not the system 
u(x, y, Z) = x + xyz 
vx, y, z) = y xy 
w(x, y, z) =z + 2x 432" 
can be solved for x, y, z in terms of u, v, w near (x, y, z) = (0, 0, 0). 


9. Consider f(x, y) = ((x? — y*)/(x? + °), xy/(x? + y?)). Does this map of IR2W(0, 0) to 
TR? have a local inverse near (x, y) — (0, 1)? 


10. (a) Define x: R? > R by x(r, 0) =r cos and define y: R? > R by y(r, 6) =r sind. 
Show that 


a(x, y) _ 
87, 0) lg, ai) 


To. 


(b) When can we form a smooth inverse function (r(x, y), 0(x, y))? Check directly and 
with the inverse function theorem. 
(c) Consider the following transformations for spherical coordinates (see Section 1.4): 
x(p, 0,0) = psinó cos 
x(p, $, 0) = psing sind 
2(p, $, 0) = pcos. 
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Show that the Jacobian determinant is given by 


a(x, y, z) 
9(p, $, 0) 


(d) When can we solve for (p, $, 0) in terms of (x, y, z)? 


= p? sin. 


11. Let (xo, yo, zo) be a point of the locus defined by z? + xy — a = 0, z? +x? — 
y? — b — 0, where a and b are constants. 


(a) Under what conditions may the part of the locus near (xo, yo, zo) be represented in 
the form x = f(z), y = g(z)? 
(b) Compute /’(z) and g'(z). 


12. Is it possible to solve the system of equations 
xy! 4 xzu 4 yv? =3 
wyz +2xv w^? —2 
for u(x, y, Z), v(x, y, Z) near (x, y, z) = (1, 1, 1), (u, v) = (1, 1)? Compute ðv/ðy at 


(x,y,z) = (4 1, 1). 


13. The problem of factoring a polynomial x" + a,—\x"~! + - - - + ao into linear factors is, in 
a sense, an "inverse function" problem. The coefficients a; may be thought of as functions of 
the 7 roots rj. We would like to find the roots as functions of the coefficients in some region. 
With n — 3, apply the inverse function theorem to this problem and state what it tells you 
about the possibility of doing this. 


REVIEW EXERCISES FOR CHAPTER 3 


1. Analyze the behavior of the following functions at the indicated points. [Your answer in 
part (b) may depend on the constant C.] 


(a) z— x? — y? + 3xy, (x, y) = (0, 0) 
(b z2x'—- y! Cxy,  (x,y)= (0,0) 


2. Find and classify the extreme values (if any) of the functions on IR? defined by the 
following expressions: 


(a) yr- (b) & — 1? +e- y? (c) x? xy x y* 


3. (a) Find the minimum distance from the origin in R? to the surface z = /x? — 1. 
(b) Repeat part (a) for the surface z = 6xy + 7. 


4. Find the first few terms in the Taylor expansion of f(x, y) — e"' cosx about x — 0, 
y=0. 
5. Prove that 


3x4 — 4x3 — 12x? + 18 
z= iO 
12(1 +47?) 
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has one local maximum, one local minimum, and one saddle point. (The graph is shown in 
Figure 3.R.1.) 


Figure 3.R.1 Graph of z = 
(3x4 — 4x3 — 12x? + 18)/12(1 + 452). 


6. Find the maxima, minima, and saddles of the function z = (2 + cos zx )(sin zt y), which 
is graphed in Figure 3.R.2. 


Figure 3.R.2 Graph of z = 
(2 + cos x) (sin zy). 


Figure 3.R.3 Graph of z = y sin(zx). 
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8. A graph of the function z = sin(zx)/(1 + y?) is shown in Figure 3.R.4. Verify that this 
function has alternating maxima and minima on the x axis, with no other critical points. 


Figure3.R.4 Graph of 
z = sin(zx)/(1 + y?). 


In Exercises 9 to 14 find the extrema of the given functions subject to the given constraints. 
9. f(x, y) =x? — 2xy + 25, subject to x? + y? = 1 
10. f(x, y) = xy — y^, subject to x? + y? = 1 


11. fŒ, y) = cos(x? — y?), subject to x? + y? = 1 
2 


12. f(x. y)-7 Es bjectto x +y = 1 
. Sx, y= , Subject to x = 
Y= 2 ye J y 


13. z — xy, subject to the condition x + y = 1. 
14. z = cos? x + cos? y, subject to the condition x + y = 7/4. 
15. Find the points on the surface z? — xy — 1 nearest to the origin. 
16. Use the implicit function theorem to compute dy/dx for 
(a) x/y — 10 (b) x? — siny + y^ =4 (c) e^? +y3=0 


17. Find the shortest distance from the point (0, 5) to the parabola x? — 4y — 0. Solve this 
problem using the Lagrange multiplier method and also without using Lagrange's method. 


18. Solve the following geometric problems by Lagrange's method. 


(a) Find the shortest distance from the point (ai, a2, a3) in IR? to the plane whose 
equation is given by bx, + b2x2 + b3x3 + bo = 0, where (bi, b2, b3) Æ (0, 0, 0). 

(b) Find the point on the line of intersection of the two planes a,x) + a2x2 + a3x3 = 0 
and b,x; + box» + b3x3 + bo = 0 that is nearest to the origin. 

(c) Show that the volume of the largest rectangular parallelepiped that can be inscribed 
in the ellipsoid 


x 
reg ee eed 
a 


is 8abc/3V3. 
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19. A particle moves in a potential V(x, y) = x? — y? +x? + 3xy. Determine whether (0, 0) 
is a stable equilibrium point—that is, whether or not (0, 0) is a strict local minimum of V. 


20. Study the nature of the function f(x, y) = x? — 3xy? near (0, 0). Show that the point 
(0, 0) is a degenerate critical point, that is, D — 0. This surface is called a monkey saddle. 


21. Find the maximum of f(x, y) = xy on the curve (x + 1? + y? = 1. 
22. Find the maximum and minimum of f(x, y) = xy — y +x — 1 on the set x? + y? < 2. 
23. The Baraboo, Wisconsin, plant of International Widget Co., Inc., uses aluminium, 
iron, and magnesium to produce high-quality widgets. The quantity of widgets that may 
be produced using x tons of aluminum, y tons of iron, and z tons of magnesium is 
Q(x, y, z) = xyz. The cost of raw materials is aluminum, $6 per ton; iron, $4 per ton; and 
magnesium, $8 per ton. How many tons each of aluminum, iron, and magnesium should be 
used to manufacture 1000 widgets at the lowest possible cost? (Hint: Find an extreme value 
for what function subject to what constraint?) 
24. Let f: R > R be of class C' and let 

u = f(x) 

v = —y + xf (x). 


If f (xo) Æ 0, show that this transformation of R? to R? is invertible near (xo, yo) and its 
inverse is given by 


x= fu) 
y= —vt+uf"(u). 
25. Show that the pair of equations 


vP-y-wtv+4+4=0 
2xy +y? — 2u? 3v 8-0 


determine functions u(x, y) and v(x, y) defined for (x, y) near x = 2 and y = —1 such that 
u(2, —1) = 2 and v(2, —1) = 1. Compute du/dx at (2, —1). 


26. Show that there are positive numbers p and q and unique functions u and v from the 
interval (—1 — p, —1 + p) into the interval (1 — q, 1 + q) satisfying 


xe") + u(xje = 0 = xe + v(x)e" 
for all x in the interval (C1 — p, —1 + p) with u(—1) = 1 = v(—1). 


27. To work this exercise, the reader should be familiar with the technique of diagonalizing a 
2 x 2 matrix. Let a(x), b(x), and c(x) be three continuous functions defined on U U aU, 
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where U is an open set and 8U denotes its set of boundary points (see Section 2.2). Use the 
notation of Lemma 2 in Section 3.3, and assume that for each x € U U 3U the quadratic form 
defined by the matrix 

a b 

be 


is positive-definite. For a C? function v on U U aU, we define a differential operator L by 
Lv = a(9^v/àx?) + 2b(8?v/dxdy) + c(8?v/8y?). With this positive-definite condition, such 
an operator is said to be elliptic. A function v is said to be strictly subharmonic relative to L 
if Lv > 0. Show that a strictly subharmonic function cannot have a maximum point in U. 


28. A function v is said to be in the kernel of the operator L described in Exercise 27 if 
Lv = 0on U U ðU. Arguing as in Exercise 37 of Section 3.3, show that if v achieves its 
maximum on U it also achieves it on ðU. This is called the weak maximum principle for 
elliptic operators. 


29. Let L bean elliptic differential operator as in Exercises 27 and 28. 


(a) Define the notion of a strict superharmonic function. 

(b) Show that such functions cannot achieve a minimum on U. 

(c) If v is as in Exercise 28, show that if v achieves its minimum on U it also achieves it 
on QU. 


The following method of least squares should be applied to Exercises 30 to 35. 


It sometimes happens that the theory behind an experiment indicates that the 
experimental data should lie approximately along a straight line of the form y = mx + b. The 
actual results, of course, never match the theory exactly. We are then faced with the problem 
of finding the straight line that best fits some set of experimental data (xi, yi), . .., (Xn, Yn) as 
in Figure 3.R.5. If we guess at a straight line y = mx + b to fit the data, each point will 
deviate vertically from the line by an amount d; = y; — (mx; + b). 


05,3) 


Gy) 
Figure 3.R.5 The method of least squares 
tries to find a straight line that best 
approximates a set of data. 


[5232] 


We would like to choose m and b in such a way as to make the total effect of these 
deviations as small as possible. However, because some are negative and some positive, we 
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could get a lot of cancellations and still have a pretty bad fit. This leads us to suspect that a 
better measure of the total error might be the sum of the squares of these deviations. Thus, we 
are led to the problem of finding the m and b that minimize the function 


s = f(m, b) d 1 d+- t d Yor — mx; — by, 
i-l 


where xi, ..., x, and yi, ..., y, are the given data. 


30. For each set of three data points, plot the points, write down the function f(m, b) from 
the preceding equation, find m and b to give the best straight-line fit according to the method 
of least squares, and plot the straight line. 


(a) Q3, 1) 7 (1, T) (b) G1, 1) = (0,0) 
(x2, ¥2) = (2, 3) (32, 2) = (1,2) 
(3, y3) = (4, 3) (x3, y3) = (2, 3) 


31. Show that if only two data points (x, yı) and (x2, y2) are given, this method produces the 
line through (xi, yi) and (x2, y2). 


32. Show that the equations for a critical point, ds/0b = 0 and ds/dm = 0, are equivalent to 


M(E) t= (X) ad (X) (33) = (Da) 


where all the sums run from i = 1 toi =n. 


33. If y = mx +b is the best-fitting straight line to the data points (xi, 1), ..., (Xn, Yn) 
according to the least-square method, show that 


3o — mx; — b) = 0; 
i=l 


that is, the positive and negative deviations cancel (see Exercise 32). 
34. Use the second derivative test to show that the critical point of f is a minimum. 


35. Use the method of least squares to find the straight line that best fits the points 
(0, 1), (1, 3), (2, 2), (3, 4), and (4, 5). Plot the points and line." 


17The method of least squares may be varied and generalized in a number of ways. The basic idea can be applied to 
equations of more complicated curves than the straight line. For example, this might be done to find the parabola that best 
fits a given set of data points. These ideas also formed part of the basis for the development of the science of cybernetics 
by Norbert Wiener. Another version of the data is the following problem of least-square approximation: Given a function 
J defined and integrable on an interval [a, b], find a polynomial P of degree <n such that the mean square error 


b 
[ ve reas 


is as small as possible. 
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... who by vigor of mind almost divine, the motions and figures of the 
planets, the paths of comets, and the tides of the seas first demonstrated. 


-Nawton's Epitaph 


hapters 2 and 3 focused on real-valued functions. This chapter is largely con- 

cerned with vector-valued functions. We begin in the first section with a con- 
tinuation of our study of paths, adding applications of Newton's second law. Then 
we study arc length of paths. Following this, we introduce the divergence and curl 
of a vector field which, in addition to the gradient, are basic operations in vector 
differential calculus. The basic geometry and calculus of the divergence and curl are 
studied. The associated integral calculus will be given in Chapter 8. 


4.1 Acceleration and Newton's Second Law 


In Section 2.4, we studied the basic geometry of paths, learning how to sketch curves 
(the images of paths) and compute tangent lines. We also learned to think of, as the 
name suggests, a path as the trajectory of a particle and to regard the derivative of the 
path as its velocity vector. In this section, we continue our study of paths, including 
additional topics, especially acceleration and Newton's second law. 


Differentiation of Paths 


Recall that a path in R” is a map c of R or an interval in R to R”. If the path is differen- 
tiable, its derivative at each time t is an n x | matrix. Specifically, if xi(£), . . . , Xn (£) 
are the component functions of c, the derivative matrix is 


dxı/dt 

dxı/dt 
e(n- . s 

dx, /dt 
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which can also be written in vector form as 
(dxi/dt, ..., dx, /dt) or as (x1(t), ...,.x,(t)). 


Recall from Section 2.4 that c'(t) is the tangent vector to the path at the point 
c(t). Also recall that if c represents the path of a moving particle, then its velocity 
vector is 

v=c'(t), 


and its speed is s = ||v||. 
The differentiation of paths is facilitated by the following rules. 


Differentiation Rules Let b(f) and c(t) be differentiable paths in IR? and p(t) 
and q(t) be differentiable scalar functions: 


Sum Rule: z b(t) + c(t)] = b'(t) + c(t) 


Scalar Multiplication Rule: | —[p(t)e(t)] = p'(t)e(t) + p(t)e (t) 


Dot Product Rule: lbo -c(t)] = b'(t) - e(t) + b(t) - c(t) 


Cross Product Rule: LO x e(t)] = b'(t) x e(t) + b(t) x e'(r) 


Chain Rule: “teal = q'(t)e'(q(t). 


These rules follow by applying the usual differentiation rules to the components. 


IJO Show that if c(/) is a vector function such that ||c(/)|| is constant, 
then c'(r) is perpendicular to c(f) for all t. 


SOLUTION Because ||c(£)|| is constant, so is its square ||c(7)]? = c(t) - c(t). The 
derivative of this constant is zero, so by the dot product rule, 


d 
Oe S eee = e'(t)- c(t) + e(t) - (0) = 2e(t)- e'(0; 
thus, e(z)» e'(7) = 0; that is, e’(f) is perpendicular to c(t). A 
For a path describing uniform rectilinear motion, the velocity vector is constant. In 


general, the velocity vector is a vector function v = c'(f) that depends on ż. The 
derivative a = dv/dt = c"(f) is called the acceleration of the curve. If the curve is 
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(x(t), y(t), 2(t)), then the acceleration at time ¢ is given by 


a(t) = x"(t)i + y" (t)j + z" (Ok. 


2 es) A particle moves in such a way that its acceleration is constantly 
equal to —k. If the position when ¢ = 0 is (0, 0, 1) and the velocity at / = 0 isi + j, 
when and where does the particle fall below the plane z — 0? Describe the path 
traveled by the particle (assume t > 0). 


SOLUTION Let (x(t), y(t), z(t)) be the path traced out by the particle, so that 
the velocity vector is e'(t) = x'(t)i + y'(t)j + z'(t)k. The acceleration e”(t) is —k, 
so x"(t) = 0, y"(t) = 0, and z"(t) = —1. It follows that x'(t) and y'(t) are constant 
functions, and z'(f) is a linear function with slope —1. Because c'(0) = i +j, we 
get c'(t) =i +j — tk. Integrating again and using the initial position (0, 0, 1), we 
find that (x(t), y(t), z(t)) = (t, t, 1 — in) The particle drops below the plane z — 0 
when | — ie = 0; that is, £ = /2 (because t > 0). At that instant, the position is 
(V2, V2, 0). The path traveled by the particle is a parabola in the plane y = x (see 
Figure 4.1.1), because in this plane the equation is described by z = 1 — jx. A 


Figure 4.1.1 The path of the particle with initial position 
v(0) (0, 0, 1), initial velocity i + j, and constant acceleration —k 


my is a parabola in the plane y = x. 


(42,42, 0) 


The image ofa C! path is not necessarily “very smooth”; indeed, it may have sharp 
bends or changes of direction. For instance, the cycloid c(t) = (t — sint, 1 — cos t) 
shown in Figure 2.4.6 has cusps at all points where c touches the x axis (that is, when 
1 — cost = 0, which happens when ( = 2ztz, n = 0, +1, .. .). Another example is the 
hypocycloid of four cusps, c: [0, 27] — R?, t (cos? t, sin? t), which has cusps at 
four points (Figure 4.1.2). At all such points, however, c'(^) = 0, and the tangent line 
is not well defined. Evidently, the direction of c'(f) may change abruptly at points 
where it slows to rest. 

A differentiable path c is said to be regular at t = to if e'(to) # 0. If c'(t) Æ 0 for 
all t, we say that c is a regular path. In this case, the image curve looks smooth. 


[EXAMPLE 3 A particle moves along a hypocycloid according to the equations 
x= cot, y=sint, a<t<b. 


What are the velocity and speed of the particle? 
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Image of c 


gm * — *|ook smooth.” 


SOLUTION The velocity vector of the particle is 


[20] Figure 4.1.2 The image of the smooth path 
e(t) = (cos? t, sin? t), a hypocycloid, does not 


= Ziy 2j = —(3 sint cos? t)i + (3 cost sir? t)j, 


and its speed is 


s = ||vl| = (9 sin? t cos^ t +9 cos? t sin* t)? = 3|sint| |cost|. a 


Newton's Second Law 


If a particle of mass m moves in IR?, the force F acting on it at the point c() is related 
to the acceleration a(t) by Newton’s second law:! 


F(e(t)) = ma(t). 


In particular, if no forces act on a particle, then a(t) = 0, so e’(t) is constant and the 
particle follows a straight line. 


Acceleration and Newton's Second Law The acceleration of a path c(t) is 


a(t) = e"(t). 


If F is the force acting and m is the mass of the particle, then 


F = ma. 


! Most scientists acknowledge that F = ma is the single most important equation in all of science and engineering. 
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In the problem of determining the path e(ż) of a particle under the influence of a 
given force field, F, Newton's law becomes a differential equation (i.e., an equation 
involving derivatives) for c(t). 

For example, the motion of a planet moving along a path r(¢) around the sun 
(considered to be located at the origin in IR?) obeys the law 

5 GmM 


mr = Y, 
r? 


where M is the mass of the sun, m that of the planet, r = ||r||, and G is the grav- 
itational constant. The relation used in determining the force, F = — Gm Mr/r?, is 
called Newton's law of gravitation (see Figure 4.1.3). We shall not make a general 
study of such equations in this book, but content ourselves with the special case of cir- 
cular orbits. (More general orbits—the conic sections—are discussed in the Internet 
supplement.) 


Figure 4.1.3 A mass M attracts a mass m witha 
force F given by Newton’s law of gravitation: 
F = —GmMr/r?. 


Circular Orbits 


Consider a particle of mass m moving at constant speed s in a circular path of radius 
ro. Supposing that it moves in the xy plane, we can suppress the third component and 
write its location as 

st ASE 

r(t) = | ro cos —, ro sin — J. 

ro ro 
Note that this is a circle of radius ro and that its speed is given by ||r’(t)|| = s. The 
quantity s/ro is called the frequency and is denoted w. Thus, 


r(t) = (ro cos ot, rosin ct). 
The acceleration is given by 
2 sts? RS. 


H s : ge 2 
a(t) 2r'(nz- ( cos —, sin ) i. r(t) w r(t). 
0 


ro ro ro ro 
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Thus, the acceleration is in a direction opposite to r(t); that is, it is directed toward the 
center of the circle (see Figure 4.1.4). This acceleration multiplied by the mass of the 
particle is called the centripetal force. Even though the speed is constant, the direction 
of the velocity is continuously changing and therefore the acceleration, which is a 
rate of change in either speed or direction or both, is nonzero. 


r()-v 


EO] Figure 4.1.4 The position, velocity, and acceleration of a 
a(t) - particle in circular motion. 


Newton's law helps us discover a relationship between the radius of the orbit of a 
revolving body and the period, that is, the time it takes for one complete revolution. 
Consider a satellite of mass m moving with a speed s around a central body with mass 
M in a circular orbit of radius rọ (distance from the center of the spherical central 
body). By Newton’s second law F = ma, we get 

2 

Eg a PR 
ro rò 


The lengths of the vectors on both sides of this equation must be equal. Hence, 
ha GM 
ro 


Ss 


If T denotes the period, then s = 27rro/ T; substituting this value for s in the preceding 
equation and solving for 7, we obtain the following: 


Kepler’s Law 


P= 3 (2x) 


=T GM ` 


Thus, the square of the period is proportional to the cube of the radius. 


We have defined two basic concepts associated with a path; its velocity and its accel- 
eration. Both involve differential calculus. The basic concept of the length of a path, 
which involves integral calculus, will be taken up in the next section. 


Suppose that a satellite is to be in a circular orbit about the earth 
such that it stays fixed in the sky over one point on the equator. What is the radius of 


4.1 Acceleration and Newton's Second Law 267 


such a geosynchronous orbit? (The mass of the earth is 5.98 x 107^ kilograms and 
G = 6.67 x 107! in the meter-kilogram-second system of units.) 


SOLUTION The period ofthe satellite should be 1 day, so 7 — 60 x 60 x 24 — 
86,400 seconds. From the formula 7? = ro(2z ^ / GM, we get rò = T?GM/Qny,, 
and so 


3 T?GM (86,400) x (6.67 x 1071!) x (5.98 x 10%) 
n= xy — Qr’ 


= 7.54 x 10? m°. 


Thus, ro = 4.23 x 107 m = 42,300 km 7 26,200 mi. A 


Supplement to Section 4.1: Planetary Orbits, Hamilton’s Principle, 
and Spacecraft Trajectories 

In this section, we have been studying paths in space and Newton’s second law. 
Hopefully, the student realizes that these ideas apply to the real world—the motion of 


our earth around the sun, for example, is governed by these laws. But there is more 
to the story, and we will try to convey some of it here. 


aa Peal eee 


Kepler, Newton, and Hamilton 


As we discussed in the historical introduction, the law of planetary motion 
stating that the square of the period is proportional to the cube of the radius 
of an orbit is one of the three that Kepler observed before Newton formulated 
his laws of motion, known more generally as Newton’smechanics. These 
mechanics enable one to compute the period of a satellite about the earth or a 
planet about the sun (when the radius of its orbit is given), and, as we will 
indicate shortly, trajectories of space missions. 

Kepler discovered and used results like this not only for circular orbits 
but more generally for elliptical orbits. Newton was able to derive Kepler’s 
three celestial laws from his own law of gravitation. The neat mathematical 
order of the universe that these laws provided had a great impact on 
eighteenth-century thought. 

Newton never wrote down his laws of mechanics as differential 
equations. This was first done by Euler around 1730. Newton made most of 
his deductions (at least those in published form) by geometric methods. 
Euler also showed how Newton’s equations followed from Maupertuis’s 
action principle. The clearest version of the action principle in mechanics, 
now known as Hamilton’s principle, is due to William Rowan Hamilton 
around 1830, who, as we all should now know, happens to also be the father 
of vector calculus. Hamilton’s version of Maupertuis’s principle was 
elegantly presented by Richard Feynman, as we discuss next. 
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Feynman and Hamilton's Principle 


In his legendary Caltech Lectures on Physics, Nobel Prize-winning physicist Richard 
Phillips Feynman (see Figure 4.1.5) included what he called a “Special Lecture” on a 
topic clearly very close to his heart—one that he first heard about from his New York 
high school teacher, Mr. Bader. Mr. Bader told his (apparently bored) student Feynman 
how principles of maxima and minima apply to the trajectories of moving objects and 
in particular how the action principle of Maupertuis, Leibniz, and Hamilton (discussed 
in Section 3.3) applies to Newton's mechanics, governed by F — ma. 


Figure 4.1.5 Richard P. Feynman (1918-1988). 


Professor Feynman, at the end of his lecture, notes that “a physicist, a student 
of Mr. Bader, in 1942 showed how this action principle applied to quantum me- 
chanics.” That student was Feynman himself, who received the Nobel Prize for his 
insights, which also included the discovery of Feynman integrals. The moral here is 
pay attention to your teachers—especially the best ones! 

We include the first part of Feynman’s lecture here and more of it in the Internet 
supplement; see Volume II, Lecture 19, of the Feynman Lectures on Physics for the 
entire lecture. 


The Principle of Least Action, by Richard Feynman 


When I was in high school, my physics teacher—whose name was Mr. 
Bader—called me down one day after physics class and said, “You look 
bored; I want to tell you something interesting." Then he told me 
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Figure 4.1.6 Feynman lecturing at Caltech. 


something which I found absolutely fascinating, and have, since then, 
always found fascinating. Every time the subject comes up, I work on it. In 
fact, when I began to prepare this lecture I found myself making more 
analyses on the thing. Instead of worrying about the lecture, I got involved 
in a new problem. The subject is this—the principle of least action. 

Mr. Bader told me the following: Suppose you have a particle (in a 
gravitational field, for instance) which starts somewhere and moves to 
some other point by free motion—you throw it, and it goes up and comes 
down [see Figure 4.1.7]. 


There 


Figure 4.1.7 


Here 


It goes from the original place to the final place in a certain amount of 
time. Now, you try a different motion. Suppose that to get from here to 
there, it went like this [see Figure 4.1.8], but got there in just the same 
amount of time. 
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Imagined 
motion 


Figure 4.1.8 


Here 


ti 


Then he said this: “If you calculate the kinetic energy at every moment 
on the path, take away the potential energy, and integrate it over the time 
during the whole path, you'll find that the number you'll get is bigger than 
that for the actual motion." 

In other words, the laws of Newton could be stated not in the form 
F — ma but in the form: The average kinetic energy less the average 
potential energy is as little as possible for the path of an object going from 
one point to another. 

Let me illustrate a little better what this means. If you take the case of 
the gravitational field, then if the particle has the path x(t) (let's just take 
one dimension for a moment; we take a trajectory that goes up and down 
and not sideways), where x is the height above the ground, the kinetic 
energy is }m(dx/dt), and the potential energy at any time is mgx. Now I 
take the kinetic energy minus the potential energy at every moment along 
the path and integrate that with respect to time from the initial time to the 
final time. Let's suppose that at the original time t; we started at some 
height and at the end of the time t, we are definitely ending at some other 
place [see Figure 4.1.9]. 


Figure 4.1.9 
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Then the integral is 


The actual motion is some kind of curve—it's a parabola if we plot against 
the time—and gives a certain value for the integral. But we could imagine 
some other motion that went very high and came up and down in some 
peculiar way [see Figure 4.1.10]. 


Figure 4.1.10 


We can calculate the kinetic energy minus the potential energy and 
integrate for such a path . . . or for any other path we want. The miracle is 
that the true path is the one for which that integral is least. 


Real-Life Trajectories 


Interesting paths in IR? that obey Newton's second law occur in our own solar system 
and are used by NASA to plan space missions. One such mission, the Genesis Dis- 
covery Mission, launched from earth August 8, 2001 (and is due to return to earth in 
September 2004), has a particularly interesting trajectory, as shown in Figure 4.1.11. 
More information about this trajectory and the mission objectives can be found at 
http://genesismission.jpl.nasa.gov/. 

The points denoted L; and L3 in this figure denote places of balance (discovered 
by Euler) between the earth and the sun. A motionless spacecraft positioned there 
will remain there. There are periodic orbits about these points that we have (loosely) 
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Moon's Return 
orbit trajectory 


Transfer Earth 
to halo 


x 


AOS 


500,000 


Figure 4.1.11 Trajectory of the Genesis spacecraft from the earth to a periodic 
orbit about a million and a half kilometers from earth and the interesting return 
trajectory to earth. 


called halo orbits. The main dynamics of the spacecraft is governed by the pull of both 
the earth and the sun (and to a very small extent the moon) on the spacecraft. This 
is thus part of the famous three-body problem studied and made famous by Poincaré 
around 1890.2 


Emmy Noether and Hamilton's Principle 


Emmy Noether (1882-1935) (see Figure 4.1.12) is perhaps best known for her 
work in algebra, but she made a significant contribution to Hamilton's principle as 
well.? For planetary motion, the angular momentum vector J = r(t) x m(t) is time- 
independent (so is a conserved quantity), as one can readily see by computing the time 
derivative of J and using F = ma (see Exercise 20). What Noether discovered was 
a deep connection between such conserved quantities and symmetries in Hamilton’s 
principle—in the case of angular momentum, this is rotational symmetry. Noether's 
discoveries have had a profound influence on the study of mechanical systems, from 
classical to quantum, ever since. 


?For more information about Poincaré, see F. Diacu and P. Holmes, Celestial Encounters. The Origins of Chaos and 
Stability, Princeton University Press: Princeton, NJ, 1996. 


3“Invariante Variationsprobleme,” Göttingen Math. Phys. 2 (1918): 235-257. 
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Figure 4.1.12 Emmy Noether (1882-1935). 


In Exercises 1 to 4, find the velocity and acceleration vectors and the equation of the tangent 


line for each of the following curves, at the given value of t. 
1. r(t) = (cos ¢)i + (sin21)j, att = 0 3. r(t) = J2ti + 


2. e(t) — (t sint, t cost, 3t), att = 0 4. e(r) 5 ti  tjd 


e'jd- ek, att —0 


+ 217k, att — 9 


In Exercises 5 to 8, let e(t) = e'i + (sin t)j + Pk and e(t) = e^'i4 


+ (cost)j — 2? k. Find 


each of the stated derivatives in two different ways to verify the rules in the box preceding 


Example 1. 


5. Se + ext)] 

6. Zee] 
d 

7. aq; le x ex(t)] 


d 
8. qo: Dex) + ex(t)]} 


9. Ifr(t) = 6ti + 3t?j + Pk, what force acts on a particle of mass m moving along r at t = 0? 
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10. Leta particle of mass 1 gram (g) follow the path in Exercise 1, with units in seconds and 
centimeters. What force acts on it at £ = 0? (Give the units in your answer.) 


11. A body of mass 2 kilograms moves on a circle of radius 3 meters, making one 
revolution every 5 seconds. Find the centripetal force acting on the body. 


12. Find the centripetal force acting on a body of mass 4 kilograms, moving on a circle of radius 
10 meters with a frequency of 2 revolutions per second. 


13. Show that if the acceleration of an object is always perpendicular to the velocity, then the 
speed of the object is constant. (HNT: See Example 1.) 


14. Show that, at a local maximum or minimum of ||r(¢)|], the vector r'(t) is perpendicular 
to r(t). 


15. A satellite is in a circular orbit 500 miles above the surface of the earth. What is the period 
of the orbit? (You may take the radius of the earth to be 4000 miles, or 6.436 x 10° meters). 


16. What is the acceleration of the satellite in Exercise 15? The centripetal force? 
17. Find the path c such that c(0) = (0, —5, 1) and e’(t) = (t, ef, t°). 
18. Let ¢ be a path in R? with zero acceleration. Prove that c is a straight line or a point. 


19. Find paths e(t) that represent the following curves or trajectories. 


(a {(@yly=e} (c) A straight line in IR? passing through 
(b) (9,5) | 4? + y? = 1] the origin and the point (a, b, c) 
(d) f(x, y) | 9x? + 165? = 4} 


20. Let c(f) be a path, v(t) its velocity, and a(t) the acceleration. Suppose F is a C! mapping 
of R? to IR?, m > 0, and F(e(t)) = ma(t) (Newton's second law). Prove that 


meto) x WO] = e(t x Fito) 


(i.e., “rate of change of angular momentum = torque"). What can you conclude if F(e(t)) is 
parallel to ¢(t)? Is this the case in planetary motion? 


21. Continue the investigations in Exercise 20 to prove Kepler’s law that a planet moving 
under the influence of gravity about the sun does so in a fixed plane. 


4.2 Arc Length 
Definition of Arc Length 


What is the length of a path c(t)? Because the speed ||c’(¢)|| is the rate of change of 
distance traveled with respect to time, the distance traveled by a point moving along 
the curve should be the integral of speed with respect to the time over the interval 
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[to, tı] of travel time; that is, the length of the path, also called its arc length, is 


ty 
Le) [vena 
fo 

There is the question as to whether or not this formula actually corresponds to 
the true arc length. For example, suppose we take a curve in space and glue a string 
tightly to it, cutting the string so it exactly fits the curve. If we then remove the string, 
straighten it out and measure it with a straight edge, we surely should obtain the length 
of the curve. That our formula for arc length agrees with such a process is justified 
in the supplement at the end of this section. 


VABA The arc length of the path e(t) = (r cost,r sint), for t lying in 
the interval [0, 27]; that is, for 0 < t < 27, is 


2n 
L(c) = V (7r sint)? + (r cost)? dt = 27r, 
0 


which is the circumference of a circle of radius r. If we had allowed 0 < t < 477, we 
would have obtained 47r, because the path traverses the same circle twice (Figure 
42.) A 


Figure 4.2.1 The arc length of a circle traversed twice 
is 4zr. 


Arc Length The length of the path e(t) = (x(t), y(t), z(t)) for to < t < tı, is 


Le) = [Vir OP + OF EOP dr. 
to 


For planar curves, one omits the z'(f) term, as in Example 1. Here is an example 
in R°. 


IS C aioe) Find the arc length of (cost, sint, t?),0 < t < m. 


SOLUTION The path e(t) = (cost, sint, t?) has the velocity vector given by 
v = (—sint, cost, 2t). Because 


12 
Ivii = V sin? t+ cos? t - 4? = 14-4 =240+ (5) : 
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the arc length is 


Ji 2 
1o [ we « (5) dt. 


This integral may be evaluated using the following formula from the table of integrals: 


[ve +a? dx = jeve +a? +a’ log(x + Vx? +a?)] +C. 


Thus, 
Le =2-1 t/t? + oe iYi t+ P+ n 
=en 2 a) ^ 2 
t=0 
m Arit ig m+ gtd zig E 
y" 734734 8 y" Ta) 4 ya 


m 
=vl 


1 
+472 + 4 log (2t + V1 + 412) ~ 10.63. 


N 


As a check on our answer, we may note that the path c connects the points (1, 0, 0) 
and (—1, 0, z?). The distance between these points is V4 + m? ~ 3.72, which is less 
than 10.63, as it should be. A 


If a curve is made up of a finite number of pieces each of which is C! (with 
bounded derivative), we compute the arc length by adding the lengths of the compo- 
nent pieces. Such curves are called piecewise C'. Sometimes we just say “piecewise 
smooth." 


A billiard ball on a pool table follows the path e: [—1, 1] > IR? 
defined by e(t) = (x(t), y(t), z(£)) = (Itl. |t — HR 0). Find the distance traveled by the 
ball. 
SOLUTION This path is not smooth, because x(t) = |t] is not differentiable at 
0, nor is y(t) = |t — | differentiable at 7. However, if we divide the interval [—1, 1] 
into the pieces [—1, 0], [0, 2], and [5, 1], we see that x(/) and y(t) have continuous 
derivatives on each of the intervals [—1, 0], [0, 3], and [}, 1]. (See Figure 4.2.2.) 
On [-1, 0], x(t) t, y(t) t4 i and z(t) = 0, so ||e’(t)|| = /2. Hence, 
the arc length of c between — 1 and 0 is sft J/2dt = A2. Similarly, on [0, il x(t) 2 t, 
y(t) = —t + $, z(t) = 0, and again ||c'(£)|| = 4/2, so that the arc length of c between 0 
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Figure 4.2.2 A piecewise smooth path. 


and } is 14/2. Finally, on [}, 1] we have x(7) = t, y(t) = t — $, z(t) = 0, and the arc 
length of c between i and 1 is 1x2. Thus, the total arc length of c is 24/2. Of course, 


one can also compute the answer as the sum of the distances from c(— 1) to c(0) to c( 3) 
toc(l) A 


|WIN) Consider the point with position function 


c(t) = (t — sint, 1 — cost), 


which traces out the cycloid discussed in Section 2.4 (see Figure 2.4.6). Find the 
velocity, the speed, and the length of one arch. 


SOLUTION The velocity vector is e'(t) = (1 — cost, sint), so the speed of the 
point c(t) is 


lle (0| = V(1— cost)? + si t = 42 —2 cost. 


Hence, c(t) moves at variable speed although the circle rolls at constant speed. Fur- 
thermore, the speed of c(t) is zero when t is an integral multiple of 27r. At these values 
of t, the y coordinate of the point c(t) is zero and so the point lies on the x axis. The 
arc length of one cycle is 


n 72. [1— cost 
msl VI=Teostdt | — i 
0 0 


eet t t 
= if sin 2 dt (because 1 — cost = 2 sin’ 2 and sin 2 > 0 on [0, 2m1) 
0 
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The Differential of Arc Length 


The arc-length formula suggests that one introduce the following notation, which will 
be useful in Chapter 7 in our discussion of line integrals. 


Arc-Length Differential An infinitesimal displacement of a particle follow- 
ing a path c(t) = x(t)i + y(t)j + z(t)k is 


dy, dz 
ds = dxi + dyj + dzk = | —i+ —j dt, 
s xi + dyj + dz (22 I+ a k) 
and its length 


dx\? dy 2 dz WV? 
ds =. fax? xd +dz = 
s dx? + dy? + dz rE +(2) «($) dt 


is the differential of arc length. See Figure 4.2.3. 


ex" ds = lids || = V dx d dz? 


x 


Figure 4.2.3 Differential of arc length. 


These formulas help us remember the arc-length formula as 
t 
arc length — f ds. 
to 


As we have done before with such geometric concepts as length and angle, we can 
extend the notion of arc length to paths in n-dimensional space. 
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Arc Length in IR" Let e: [f, tj] + R” be a piecewise C! path. Its length is 
defined to be 


Ln 
1e) = f Ora 
to 
The integrand is the square root of the sum of the squares of the coordinate 
functions of c'(1): If 
e(t) = (x1 (t), x2(t), - - - , Xn(t)), 


then 


Lo | VAO + OOP FF eua. 


Find the length of the path e(t) = (cost, sint, cos2t, sin 2t) 
in IR^, defined on the interval from 0 to zr. 


SOLUTION We have c'(f) = (—sint, cost, —2 sin2t, 2 cos2t), and so 


lle (DI = vV sin? t + cos? t +4 sin? 2t +4 cos? 2t = V1 +4 = V5, 
a constant, so the length of the path is 


[ 5a - 8. A 


It is common to introduce the arc-length function s(t) associated to a path e(t) 
given by 


t 
sO= f 1eGot du, 
so that (by the fundamental theorem of calculus) 


sO = Ile Cl 


and 
b 
f s'(t)dt = s(b) — s(a) = s(b). 
a 
Da Consider the graph of a function of one variable y = f(x) for x 


in the interval [a, b]. We can consider it to be a curve parametrized by t = x, namely, 
e(x) = (x, f(x)) for x ranging from a to b. The arc-length formula gives 


b 
L9- f VIFO dx, 


which agrees with the formula for the length ofa graph from one-variable calculus. A 
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Justification for the Arc-Length Formula 


The following discussion assumes an acquaintance with the definite integral defined 
in terms of Riemann sums. If your background in this topic needs reinforcement, the 
material may be postponed until after Chapter 5. 

In IR? there is another way to justify the arc-length formula based on polygonal 
approximations. We partition the interval [a, b] into N subintervals of equal length: 


a=b cti «c: «ty —b; 


b—a 


figi—t; = for O0xizN-Il. 


We then consider the polygonal line obtained by joining the successive pairs of 
points c(f;), e(t;41) for0 < i < N — 1. This yields a polygonal approximation to c as 
in Figure 4.2.4. By the formula for distance in IR?, it follows that the line segment 
from c(;) to e(¢;41) has length 


lieti) — e) = Vien) — xP + DG) — »G)P + Eta) — EP, 


where c(t) = (x(t), y(t), z(t)). Applying the mean-value theorem to x(t), y(t), and 
z(t) on [t;, 4:41], we obtain three points tř, t}*, and £^* such that 


x(ti) — x(t) = x (ta — ti), 
(tii) — y(t) = YG VG — ti), 


c(fy) = C(b) 


a= fj fj f ty zb 


Figure 4.2.4 A path c may be approximated by a polygonal path obtained by 
joining each c(f;) to c(f;.,.1) by a straight line. 
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and 
2(ti41) — Z(t) = z (yt — ti). 


Thus, the line segment from c(f;) to c(t;,,) has length 


Vx GP + DEP + E" GP Ga — hi). 


Therefore, the length of our approximating polygonal line is 


N-1 
Sy = XO Vr GP + GPP Ema — t) 
i=0 


As N — ox, this polygonal line approximates the image of c more closely. There- 
fore, we define the arc length of c as the limit, if it exists, of the sequence Sy as 
N — oo. Because the derivatives x’, y’, and z’ are all assumed to be continuous on 
[a, b], we can conclude that, in fact, the limit does exist and is given by 


b 
limit $y = f VOR + DOF +OP at. 


(The theory of integration relates the integral to sums by the formula 
b N-I 
f f(t)dt = limit Y ^ Stia — ti), 
a RET 


where fo, ... , ty isapartition of [a, b], t? € [f ti+ı]is arbitrary, and f is a continuous 
function. Here we have possibly different points t7, t^, and t**, and so this formula 
must be extended slightly.) 


EXERCISES 

Find the arc length of the given curve on the specified interval in Exercises 1 to 6.4 
1. (2 cost, 2 sint, t), for0 < t < 2x 
2. (1,307, 03), for0 €t <1 


3. (sin3¢, cos35, 2/7), for0 < t < 1 


^Several of these problems make use of the formula 
[v2 +a dx = hve +a? +a" log(x + Vx? +a)| +C 


from the table of integrals in the back of the book. 
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242 1 
4. (emen) e 5. (t,t,2),forl «t2 


6. (t,t sint, t cost), for0 <t € x 


7. Find the length of the path e(t), defined by e(t) = (2 cost, 2 sint, t), if 0 < £ < 2z and 
c(t) = (2, t — 27, t), if2z < t x 4n. 


8. Let c be the path e(t) = (t, t sint, t cost). Find the arc length of c between the two 
points (0, 0, 0) and (zr, 0, — ). 


9. Let c be the path e(t) = (2t, t°, log t), defined for t > 0. Find the arc length of ¢ between 
the points (2, 1, 0) and (4, 4, log 2). 


10. The arc-length function s(t) for a given path e(t), defined by s(t) = T lle(x)|| dt, 
represents the distance a particle traversing the trajectory of e will have traveled by time / 
if it starts out at time a; that is, it gives the length of c between c(a) and e(t). Find the 
arc-length functions for the curves a(t) = (cosh f, sinh t, t) and A(t) = (cos t, sint. t), 
with a = 0. 


11. Let c(r) be a given path, a < t < b. Let s = a(t) be a new variable, where o is a strictly 
increasing C! function given on [a, b]. For each s in [o(a). œ(b)] there is a unique ¢ with 
a(t) = s. Define the function d: [o(a), a(b)] — R? by d(s) = e(t). 


(a) Argue that the image curves of c and d are the same. 
(b) Show that c and d have the same arc length. 
(c) Lets = a(t) = qr lle'(t)|| dt. Define d as above by d(s) = e(t). Show that 


d 
[aco =1. 


The path s ++ d(s) is said to be an arc length reparametrization of c (see also Exercise 13). 


Exercises 12 to 17 develop some of the classic differential geometry of curves. 


12. Let e: [a, b] — R? be an infinitely differentiable path (derivatives of all orders exist). 
Assume e’(t) # 0 for any t. The vector e’(t)/||e’(t)|| = T(t) is tangent to c at e(ż), and, 
because || T(/)|| = 1, T is called the unit tangent to c. 


(a) Show that T’(¢) - T(t) = 0. [Hint: Differentiate T(t) - T(t) = 1.] 
(b) Write down a formula for T'(1) in terms of c. 


13. (a) A path c(s) is said to be parametrized by arc length or, what is the same thing, to 
have unit speed if ||c'(s)|| = 1. For a path parametrized by arc length on [a, b], show that 
l(c) 2 b—a. 

(b) The curvature at a point ce(s) on a path is defined by k = |[T'(s)|| when the path is 
parametrized by arc length. Show that k = ||e"(s)]|. 

(c) Ifc is given in terms of some other parameter ¢ and c'(r) is never 0, show that 


k = feto x ONON. 
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(d) Calculate the curvature of the helix e(t) = (1/4/2)(cost, sint, t). (This c is a scalar 
multiple of the right-circular helix.) 


14. If T(t) 4 0, it follows from Exercise 12 that N(t) = T'(r)/|| T'(7)|| is normal (i.e., 
perpendicular) to T(r); N is called the principal normal vector. Let a third unit vector that is 
perpendicular to both T and N be defined by B = T x N; B is called the binormal vector. 
Together, T, N, and B form a right-handed system of mutually orthogonal vectors that may 
be thought of as moving along the path (Figure 4.2.5). Show that 


dB 
(a) FA B=0. (c) dB/dt is a scalar multiple of N. 
dB 
—-T=0. 
6 a 
a 
e(t) 
CN 


SW Figure 4.2.5 The tangent T, principal normal N, and 


binormal B. 
nd 


15. If ce(s) is parametrized by arc length, we use the result of Exercise 14(c) to define a 
scalar-valued function r, called the torsion, by 
dB 
— =-tN. 
ds B 
(a) Show that t = [e'(s) x e"(s)] - e"(s)/lle "(s)". 
(b) Show that if c is given in terms of some other parameter t, 


— [e (0 xe)" 
le) x eoD 


Compare with Exercise 13(c). 
(c) Compute the torsion of the helix e(t) = (1/2) cost, Sint, f). 


16. Show that if a path lies in a plane, then the torsion is zero. Do this by demonstrating that 
B is constant and is a normal vector to the plane in which c lies. (If the torsion is not zero, it 
gives a measure of how fast the curve is twisting out of the plane of T and N.) 


17. (a) Use the results of Exercises 13, 14, and 15 to prove the following Frenet formulas 
for a unit-speed curve: 
dT dN dB 


ag = kN; Pr —kT + tB; ds —tN. 
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(b) Reexpress the results of part (a) as 


for a suitable vector c». 


18. In special relativity, the proper time of a path c: [a, b] > IR^ with components given by 
€(A) = (x(A), yA), 2(A), t(A)) is defined to be the quantity 


b 
[ VKP — D'0P — OP + PAP da, 


where c is the velocity of light, a constant. In Figure 4.2.6, show that, using self-explanatory 
notation, the “twin paradox inequality" holds: 


proper time (AB) + proper time (BC) < proper time (AC). 


Figure 4.2.6 The relativistic triangle inequality. 


19. The early Greeks knew that a straight line was the shortest possible path between two 
points. Euclid, in his book Optics, stated the “principle of the reflection of light"—that is, 
light traveling in a plane travels in a straight line, and when it is reflected across a mirror, the 
angle of incidence equals the angle of reflection. 

The Greeks could not have had a proof that straight lines provided the shortest path 
between two points because they, in the first place, had no definition of the length of a path. 
They saw this property of straight lines as more or less *obvious." 

Using the justification of arc length in this section and the triangle inequality of Section 
1.5, argue that if co is the straight-line path co(¢) = tP + (1 — 1)Q between P and Q in R?, then 


L(eo) < L(c) 


for any other path c joining P and Q. 
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4.3 Vector Fields 
'The Concept of a Vector Field 


In Chapter2, we introduceda particular kind of vector field, the gradient. In this section 
we study general vector fields, discussing their geometric and physical significance. 


Vector Fields A vector field in IR" is a map F: A C R” — R” that assigns to 
each point x in its domain A a vector F(x). If n = 2, F is called a vector field in 


the plane, and if n — 3, F is a vector field in space. 


Picture F as attaching an arrow to each point (Figure 4.3.1). By contrast, a map 
f:A C R” — R that assigns a number to each point is a scalar field. A vector field 
F(x, y, z) on R? has three component scalar fields F, F), and F3, so that 


F(x, y, z) = (Fix, y, z), Fax, y, z), FQ. y, 2))- 


Similarly, a vector field on R” has n components F;, ..., Fa. If each component is a 
C* function, we say the vector field F is of class C*. Vector fields will be assumed to 
be at least of class C! unless otherwise noted. 


Figure 4.3.1 A vector field F assigns a vector F(x) to 
each point x of its domain. 


In many applications, F(x) represents a physical vector quantity (force, velocity, etc.) 
associated with the position x, as in the following examples. 


The flow of water through a pipe is said to be steady if, at each 
point inside the pipe, the velocity of the fluid passing through that point does not 
change with time. (Note that this is quite different from saying that the water in the 
pipe is not moving.) Attaching to each point the fluid velocity at that point, we obtain 
the velocity field V of the fluid (see Figure 4.3.2). Notice that the length of the arrows 
(the speed), as well as the direction of flow, may change from point to point. A 
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Figure 4.3.2 A vector field describing the 
velocity of flow in a pipe. 


[SEES] Some forms of rotary motion (such as the motion of particles on 
a turntable) can be described by the vector field 


V(x, y) = —yi+ xj. 


See Figure 4.3.3, in which we have shown instead of V the shorter vector field iV 
so that the arrows do not overlap. This is a common convention in drawing pictures 


of vector fields. A 


>% 


Figure 4.3.3 A rotary vector field. 


DOA G In the plane, R2, let the vector field x be defined by 


yi xj » x 
Vey) x+y? 235 - (aL aia) 


(except at the origin, where V is not defined). This vector field is a good approximation 
to the planar part of the velocity of water flowing toward a hole in the bottom of a tub 
(Figure 4.3.4). Notice that the velocity becomes larger as you approach the hole. A 


Gradient Vector Fields 
In Section 2.6 we introduced the gradient of a function by 
af f f 


.18 ..9 
VIG y z) = 4-6 y Dit a, rt ag; 0» DK 
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Figure 4.3.4 The vector field describing circular flow in a tub. 


Now we want to think of this as an example of a vector field—it assigns a vector to 
each point (x, y, z). As such, we refer to V f as a gradient vector field. Gradient fields 
come up in a variety of situations, as the next two examples show. 


E 


A piece of material is heated on one side and cooled on another. 
The temperature at each point within the body is described at a given moment by a 
scalar field T (x, y, z). The flow of heat may be marked by a field of arrows indicating 
the direction and magnitude ofthe flow (Figure 4.3.5). This energy or heat flux vector 
field is given by J = —kVT, where k > 0 is a constant called the conductivity and 
VT is the gradient of the real-valued function T. Level sets of T are called isotherms. 
Note that the heat flows from hot regions toward cold ones, since — VT points in the 
direction of decreasing T. A 


Figure 4.3.5 A vector field describing the 
direction and magnitude of heat flow. 


The force of attraction of the earth on a mass m can be described 
by a vector field called the gravitational force field. Place the origin of a coordinate 
system at the center of the earth (assumed spherical). According to Newton's law of 
gravity, this field is given by 


mMG 


r=- 


r, 
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where r(x, y, z) = (x, y, z), and r = ||r|| (see Figure 4.3.6). The domain of this 
vector field consists of those r for which ||r|| is greater than the radius of the earth. 
As we saw in Example 6, Section 2.6, F is a gradient field, F — —V V, where 


mMG 
r 


is the gravitational potential. Note again that F points in the direction of decreasing 
V. Writing F in terms of components, we see that 


—mMG  —mMG  —mMG 
F(x, y, z) p x, 3 a zl. 


r 


Figure 4.3.6 The vector field F given by Newton's 
law of gravitation. 


ISHS According to Coulomb's law, the force acting on a charge e at 


position r due to a charge Q at the origin is 


where V = eQe/r and e is a constant that depends on the units used. For Qe > 0 
(like charges) the force is repulsive [Figure 4.3.7(a)], and for Qe < 0 (unlike charges) 
the force is attractive [Figure 4.3.7(b)]. Because the potential V is constant on the 
level surfaces of V , they are called equipotential surfaces. Note that the force field is 
orthogonal to the equipotential surfaces (the force field is radial and the equipotential 
surfaces are concentric spheres). A 


The next example shows that not every vector field is a gradient. 
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ia Figure 4.3.7 The vector fields 
~<~—e 
ä associated with (a) like charges 
pe 7 Te (Qe > 0), and (b) unlike charges 
y \ N (Qe « 0). 


» Show that the vector field V on IR? defined by V(x, y) = yi — xj 
is not a gradient vector field; that is, there is no C! function f such that 

a a 

f + af i 


V(x, y) = VAX, y) = i» 


SOLUTION Suppose that such an f exists. Then 0f/dx = y and 8f/8y = —x. 
Because these are C! functions, f itself must have continuous first- and second-order 
partial derivatives. But, 3?f/ðx 8y = —1, and 3?f/ðy ax = 1, which violates the 
equality of mixed partials. Thus, V cannot be a gradient vector field. a 


Conservation of Energy and Escaping the Earth’s Gravitational Field 


Consider a particle of mass m moving in a force field F that is a potential field. That is, 
assume F = — VV fora real-valued function V, and that the particle moves according 
to F = ma. Thus, if the path is r(t), then 

mi(t) = —VV(r(t)). (1) 


A basic fact about such motion is the conservation of energy. The energy E of the 
particle is defined to be the sum of the kinetic and potential energies, defined as 


E = Fmi + Veo). Q 


The principle of conservation of energy states that if Newton's second law holds, 
then E is independent of time; that is, dE/dt = 0. The proof of this fact is a simple 
calculation; we use equation (2), the chain rule, and equation (1): 


E 
a = mr-t¥+(VV)-F 


=r-(-VV +VV)=0. 
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Escape Velocity 


Flow Lines 


As an application of conservation of energy, we compute the velocity required for a 
rocket to escape the earth's gravitational influence. Assume the rocket has mass m 
and is at a distance Ro from the center of the earth (or another planet) when its escape 
velocity ve has been reached, and that it will coast thereafter. The energy at this time is 


(3) 


(4) 


where v is the velocity and R is the distance from the center of the earth (or the other 
planet). What we mean by the term escape velocity is that ve is chosen such that the 
rocket gets to great distances, at which time it is barely moving. That is, v is close 
to zero and R is very large. Thus, from equation (4), we see that E — 0 and hence 
Eo = 0; solving Eo = 0 for v, using equation (3) gives: 


2MG 
Ve = " 


Ro 


Now GM/ R? is exactly g, the acceleration due to gravity at the distance Ro from the 
center of the planet. Thus, we can write: 


Ve = J2g Ro. 


For the earth, if the escape velocity were to be achieved at the surface of the earth 
(of course, this is a bit unrealistic), this would give 


v, = V2 -9.8 m/s^- 6,371,000 m = 11,127 m/s. 


However, this is a good approximation to the velocity that a satellite in low earth 
orbit needs in order to escape the earth's gravitational field. 


An important concept related to general (not necessarily gradient) vector fields is that 
of a flow line, defined as follows. 


Flow Lines If F is a vector field, a flow line for F is a path c(t) such that 
c(t) = F(c(t)). 
That is, F yields the velocity field of the path e(t). 
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In the context of Example 1, a flow line is the path followed by a small particle 
suspended in the fluid (Figure 4.3.8). Flow lines are also appropriately called 
streamlines or integral curves. 


Velocity vector Flow line 


Figure 4.3.8 The velocity vector of a fluid is 
tangent to a flow line. 


Geometrically, a flow line for a given vector filed F is a curve that threads its 
way through the domain of the vector field in such a way that the tangent vector of 
the curve coincides with the vector field, as in Figure 4.3.9. 


| Í Figure 4.3.9 A flow line threading its way through 
wo. — a vector field in the plane. 


A flow line may be viewed as a solution of a system of differential equations. 
Indeed, we can write the definition e’(t) = F(e(t)) as 


x'(t) = PO), yO), 20), 
»'(t) = ECO, y), 2M), 
Z(t) = R(x(t), y(t), 2), 


where c(t) = x(t)i + y(t)j + z(t)k, and where 
F = Pi + Qj + Rk. 


One learns about such systems in courses on differential equations, but we are not 
presuming such a course has been taken. 
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Show that the path e(t) = (cos t, sin ¢) is a flow line of the vector 
field F(x, y) 2 —yi 4- xj. Can you find others? 


SOLUTION We must verify that c'(f) = F(c(f)). The left side is (—sin t)i + 
(cos t)j, while the right side is F(cost, sint) = (—sin t)i + (cos£)j, so we have a 
flow line. As suggested by Figure 4.3.3, the other flow lines are also circles. They 
have the form 


c(t) = (r cos(t — to), r sin (t — to)) 


for constants r and tọ. A 


In many cases, explicit formulas for flow lines are not available, so one must 
resort to numerical methods. Figure 4.3.10 shows some output from a program that 
computes flow lines numerically and plots them on the screen. 


Figure 4.3.10 Computer-generated integral curves of 
the vector field F(x, y) = (sin y)i + (x? — y)j. This 
figure was created using 3D-XplorMath, available from 
Richard Palais’ Web site at rsp.math.brandeis.edu/ 
3D-XplorMath. 
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The Field Concept 


The concept of a “field,” such as a vector field, has had an enormous impact 
on the development of conceptual frameworks for physics and engineering. It 
is truly one of the great breakthrough ideas in the history of human thought. 
It is the notion that allows one to describe, in a systematic way, influences on 
objects and between objects that are spatially separated. 

The idea of a field began with Newton'sconcept of the gravitational 
field. In this case, the gravitational field describes the attractive influence 
of one body or group of bodies on one another. Similarly, the electric field 
produced by a charged object or group of objects creates, according to 
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Coulomb'slaw, a force on another charged object. Using vector fields to 
describe these sorts of forces has led to a deeper understanding of attractive 
and repulsive forces in nature. 

However, it was the monumental discovery of the Maxwell field 
equations, which describe the propagation of electromagnetic energy, that 
cemented the concept of "field"in scientific thought. This example is 
particularly interesting because these fields can propagate. The contrast 
between the electromagnetic field that can propagate and the gravitational 
field that involves instantaneous action at a distance has caused great interest 
among philosophers of science. 

Einstein's idea is that gravitation can be described in terms of the metric 
properties of space-time and that in this theory the associated field can also 
propagate, just like the electromagnetic field, thus providing profound 
philosophical evidence that Einstein's version of gravity may be correct. 
These ideas have also led to modern efforts to detect gravitational waves. For 
a further discussion of Einstein's work, see Section 7.7. 

The idea of a field is also used in engineering to describe elastic systems 
and interesting microstructural properties of materials. In modern theoretical 
physics, the field concept is used to describe elementary particles and is 
central to attempts by modern theoretical physicists to unify gravity with the 
quantum mechanical physics of elementary particles. It is impossible to 
imagine a modern theoretical framework that does not incorporate some sort 
of field concept as a central ingredient. 


EXERCISES 


In Exercises 1 to 8, sketch the given vector field or a small multiple of it. 
1. FQ.) = (2,2) 
2. F(x, y) = (4, 0) 
3. Fo, y) = (x, y) 


4. F(x, y) = (=x, y) 


tA 


FG, y) = (2y, x) 

6. F(x, y) = (y, 22x) 

mye a ao 
"NG Stee 


8. Fi = » x 
id rr or 
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In Exercises 9 to 12, sketch a few flow lines of the given vector field. 
9. F(x, y) 2 (y, —x) 

10. F(x, y) = (x, —y) 

1. F(x, y) = (x, x?) 

12. F(x, y, z) = (y, —x, 0) 


In Exercises 13 to 16, show that the given curve c(t) is a flow line of the given velocity vector 
field F(x, y, z). 


13. e(t) = (e*, log |t|, 1/2), t 4 0; F(x, y, z) = (2x, z, —z?) 
14. e(t) = ((02,2t — 1, Vt), t > 0; F(x, y, z) = (y + 1,2, 1/22) 


15. c(t) = (sint, cost, e); F(x, y, z) = (y, —x, zZ) 
Do, Y. 4 2 
16. e) = (5.6. 7 j Fey = C32.» =°) 


17. Show that it takes half as much energy to launch a satellite into an orbit just above the 
earth as it does to escape the earth. (Ignore the rotation of the earth.) 


18. Let e(t) be a flow line of a gradient field F = —V V. Prove that V (c(t)) is a decreasing 
function of t. 


19. Suppose that the isotherms in a region are all concentric spheres centered at the origin. 
Prove that the energy flux vector field points either toward or away from the origin. 


20. Sketch the gradient field — V V for V(x, y) = (x + y)/(x? + y?) and the equipotential 
surface V = 1. 


4.4 Divergence and Curl 


For each of the divergence and curl operations, we will make use of the del operator, 
defined by 


V=i : +j : +k : 
ab an y az" 
For functions of one variable, taking a derivative can be thought of as an operation or 
process; that is, given a function y = f(x), its derivative is the result of operating on 
y by the derivative operator d /dx. Similarly, we can write the gradient as 


fa 3 af af 
v7 (ini) "ox t Yay 
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for functions of two variables, and 


.8 .8 ə of f of 
V t +k =1- k 
f (i day x) "ox lay + Kaz 


for three variables. In operational terms, the gradient of f is obtained by taking the 
V operator and applying it to f. 


Definition of Divergence 


We define the divergence of a vector field F by taking the dot product of V with F. 


Divergence If F = Fiji 4- Fj + F3k, the divergence of F is the scalar field 


F; $ dF in OF; 
ax dy üz ` 


a 
div F = V -F = 


Similarly, if F = (Fi, ..., F;) is a vector field on R”, its divergence is 
"ðR; OF, ah OF, 
div F E 
m P» Ox; n Ox | Oy 


SINY JBF] Compute the divergence of 


F =x yi+2j T xyzk. 


SOLUTION 


a a a 
div F = —(x?y) + —(z) + —(xyz) = 2xy -0--xy =3xy. A 
Ox ay Oz 


Interpretation 


The divergence has an important physical interpretation. If we imagine F to be the 
velocity field of a gas (or a fluid), then div F represents the rate of expansion per 
unit volume under the flow of the gas (or fluid). If div F <0, the gas (or fluid) is 
compressing. For a vector field F(x, y) = Fii + Fj on the plane, the divergence 


vus, S 
ax ay 


measures the rate of expansion of area. 
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This interpretation is explained graphically, as follows. Choose a small region W 
about a point xo. For each point x in W, let x(t) be the flow line emanating from x. 
The set of points x(t) describe how the set W flows after time t (see Figure 4.4.1). 


Figure 4.4.1 Flowing a region W along 
the flow lines of a vector field. 


Call the region that results after time ¢ has elapsed W(t), and let V(t) be its 
volume (or area in two dimensions). Then the relative rate of change of volume is the 
divergence; more precisely, 


1 i 
»55 KO div F(xo), 


with the approximation being more exact as W shrinks to xo. A direct proof of this is 
given in the Internet supplement, but a more natural argument is given in Chapter 8, 
in the context of the integral theorems of vector calculus. 


JC WIIZEDPE Consider the vector field in the plane given by V(x, y) = xi. 
Relate the sign of the divergence of V with the rate of change of areas under the 
flow. 


SOLUTION We think of V as the velocity field of a fluid in the plane. The vector 
field V points to the right for x > 0 and to the left if x < 0, as we see in Figure 4.4.2. 
The length of V gets shorter toward the origin. As the fluid moves, it expands (the area 
of the shaded rectangle increases), so we expect div V > 0. Indeed, div V = 1. A 


DWI BS} The flow lines of the vector field F = xi + yj are straight lines 
directed away from the origin (Figure 4.4.3). 
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| 
| 


| 


an — w ` m H 
| ES |] Figure 4.4.2 This fluid is expanding. 
» x 


y 
eh UMS ON | PB og W^ Vw 
Ez m s | Jo oum e Lom 
Figure 4.4.3 The vector 
me < > _—p>—_——>—__—_—_»>— » x field F(x, y) = xi + yj. 
Fo E E 2 LA a 
"a m A. |a e s 


If these flow lines are those of a fluid, the fluid is expanding as it moves out from the 
origin, so div F should be positive. In fact, 


9&0 8710097] Consider the vector field F = —xi — yj. Here the flow lines point 
toward the origin instead of away from it (see Figure 4.4.4). Therefore, the fluid is 
compressing, so we expect (div F) < 0. Calculating, we see that 


V-F (ence) 1-1 2«0. A 
ax oy 
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b 


W 
Su N JP n Figure 4.4.4 The vector field 
Zip 


> * F(x, y) = —xi— yj. 


DISS! As we saw in the last section, the flow lines of F = —yi+ xj 
are concentric circles about the origin, moving counterclockwise (see Figure 4.4.5). 
From this figure, it appears that the fluid is neither compressing nor expanding. This 
is confirmed by calculating 


a a 
V-F (—y)+ —(@)=0+0=0. A 
ax oy 


y 
f ee, ks NAN 

/ / / dial ad e N \ Figure 4.4.5 The vector field 
l l / | # nN \ NN F, y) neun pies 
\ \ Yours 4 ff / i Í divergence. 

XN E e a E i / 

A8 oer pg 


DANAA Some flow lines of F = xi — yj are shown in Figure 4.4.6. Here 
our intuition about expansion or compression is less clear. However, it is true that the 
shaded regions shown have the same area, and we calculate that 


a a 
V-F=—27+—(y=14+(-D=0. a 
Ox dy 
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y Fluid particles move from 


shaded region to shaded 


gy d 

"A E E ^p 
are the same. 

TN QE 

we x ¥ 


SON 


X 
rE x y ~~’ am 
<—__<—_~<—_<— -> » x 
Sa CON WX uv dan ow 


Figure 4.4.6 The vector field F(x, y) = xi — yj. 


Curl 


To calculate the curl, the second basic operation performed on vector fields, we take 
the cross product of V with F. 


Curl of a Vector Field If F = Fi + Foj + Fk, the curl of F is the vector 


field 
i j k 
ð 8 8 
1F=VxF=/— — — 
ou i ax dy Oz 
Fi Fy Fy 


OF; OF; OF, OF: OF, OF 
3 2 Jia 1 3 ja 2 3 
oy az az ax Ox ay 


If we write F = Pi + Qj + Rk, which is alternative notation, the same formula 
for the curl reads 


i j k 
curl F = KR EA 
ax dy Oz 
P« O PR 
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Io WIE UNE Let F(x, y, z) = xi 4 xyj + k. Find V x F. 
SOLUTION We use the preceding formula: 


VxF= 


i j k 
9 8 8 
ax dy dz 
x xy 1 


= (0 — 0)i — (0 — 0)j + (y — O)k. 


Thus V x F = yk. A 


aB Find the curl of xyi — sin zj + k. 


SOLUTION Letting F = xyi — sin zj + k, 


i j k 
geroji S 3 
ax dy dz 
xy —sinz 1 
0 ə aoa ð ə 
=| dy dzli—|dx əðz|j+|ə3x ay |k 
—sinz | xy 1 xy —sinz 


= coszi— xk. A 


Unlike the divergence, which can be defined in IR" for any n, we define the 
curl only in three-dimensional space (or for planar vector fields, regarding their third 
component as zero). 


'The Curl and Rotations 


The physical significance of the curl will be discussed in Chapter 8, when we study 
Stokes' theorem. However, we can now consider a specific situation, in which the curl 
is associated with rotations. 


Consider a solid rigid body B rotating about an axis L. The rota- 
tional motion of the body can be described by a vector w along the axis of rotation, 
the direction being chosen so that the body rotates about w, as in Figure 4.4.7. We 
call w the angular velocity vector. The length w = ||w|| is taken to be the angular 
speed of the body B, that is, the speed of any point in B divided by its distance from 
the axis L of rotation. The motion of points in the rotating body is described by the 
vector field v whose value at each point is the velocity at that point. To find v, let Q 
be any point in B and let o be the distance from Q to L. 

Figure 4.4.7 shows that o = ||r|| sin 60, where r is the vector whose initial point 
is the origin and whose terminal point is Q and 6 is the angle between r and the axis 
L of rotation. The tangential velocity v of Q is directed counterclockwise along the 
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Figure 4.4.7 The velocity v and angular velocity w of a 
rotating body are related by v = w x r. 


tangent to a circle parallel to the xy plane with radius a and has magnitude 
lvi = wa = |r|] sin 6 = ||w|||Ir|] sin 6. 


The direction and magnitude of v imply that v = w x r. Selecting a coordinate system 
in which L is the z axis, we can write w = wk and r = xi + yj + zk. Thus, 


vV=wxr=—oyit wxj, 
and so 
i j k 
aaa 
curl v 2@k = 2w 
ax dy az 
—wy wx 0 


Hence, for the rotation of a rigid body, the curl of the velocity vector field is a vector 
field whose value is the same at each point. It is directed along the axis of rotation 
with magnitude twice the angular speed. A 


The Curl and Rotational Flow 


If a vector field represents the flow of a fluid, then the value of V x F at a point is 
twice the angular velocity vector of a rigid body that rotates as the fluid does near 
that point. In particular, V x F = 0 at a point P means that the fluid is free from rigid 
rotations at P; that is, it has no whirlpools. Another justification of this idea depends 
on Stokes’ theorem from Chapter 8. However, we can say informally that curl F = 0 
means that if a small rigid paddle wheel is placed in the fluid, it will move with the 
fluid but will not rotate around its own axis. Such a vector field is called irrotational. 
For example, it has been determined from experiments that fluid draining from a tub 
is usually irrotational except right at the center, even though the fluid is “rotating” 
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around the drain (see Figure 4.4.8). In Example 10, the flow lines of the vector field V 
are circles about the origin, yet we show that the flow is irrotational. Thus, the reader 
should be warned of the possible confusion the word "irrotational" can cause. 


Figure 4.4.8 Looking at a paddle 
wheel from above a moving fluid. 
The velocity field V(x, y, z) — 
ee] (yi — xj)/G? + y?) is irrotational; 
the paddle wheel does not rotate 


around its axis w. 


DGA Verify that the vector field 


yi - xj 
Væ, y,z)= Mas 
is irrotational when (x, y) Æ (0, 0) (i.e., except where V is not defined). 


SOLUTION The curl is 


i i k 
a a a 

VxV=| ax ðy az 
y =x 


x24 y2 x24 y2 


a — a y 
= 0i 4- 0j -— k 
m reca) tess) 


Ge? + y?) 42x? | —(x? + y?) +29? 
(x2 + y2* ? (x2 + y2)? 


Ke a 


Gradients are Curl Free 


The following identity is a basic relation between the gradient and curl, which should 
be compared with the fact that for any vector v, we have v x v = 0. 


4.4 Divergence and Curl 303 


'THEOREM 1: Curl of a Gradient For any C? function f, 


V x (Vf) — 0. 


That is, the curl of any gradient is the zero vector. 


PROOF Because Vf = (0f/dx, 8f /8y, 8f/8z) we have, by definition, 
j k 

ð ə 

VxVf=]ļ|ðx dy az 
af of 


a a? 8? a 2? a? 
FO (OE S fl OF EI. 
ðyðz dzoy Ozüx dx dz ðxðy  OyOx 


Each component is zero because of the equality of mixed partial derivatives. m 


The converse to this theorem (a vector field with zero curl is a gradient, under 
suitable hypotheses) will be discussed in Chapter 8. 


PO ess Let V(x, y, z) = yi — xj. Show that V is not a gradient field. 


SOLUTION If V were a gradient field, then it would satisfy curl V — 0 by The- 
orem 1. But 


so V cannot be a gradient. A 


Scalar Curl 


There is an operation on vector fields in the plane that is closely related to the curl. 
If F = P(x, y)i + Q(x, y)j is a vector field in the plane, it can also be regarded as a 
vector field in space for which the k component is zero and the other two components 
are independent of z. The curl of F then reduces to 


vxr= (22 - 
ax oy 
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and always points in the k direction. The function 


of x and y is called the scalar curl of F. 


DANIJA Find the scalar curl of V(x, y) = —y?i+ xj. 


SOLUTION The curl is 


i jk 
à 3 8 
Mx |= Ss E cls ape 
i mou 5995 
-y x 0 


so the scalar curl, which is the coefficient of k, is 1 +2y. 4 


Curls are Divergence Free 


A basic relation between the divergence and curl operations is given next. 


THEOREM 2: Divergence of a Curl For any C? vector field F, 


div curl F = V - (V x F) = 0. 


That is, the divergence of any curl is zero. 


As with the curl of a gradient, the proof rests on the equality of the mixed partial 
derivatives. The student should write out the details. A converse will be discussed in 
Chapter 8. 


DAAE Show that the vector field V(x, y, z) = xi + yj + zk cannot be 
the curl of some vector field F; that is, there is no F with V = curl F. 


SOLUTION If this were so, then div V would be zero by Theorem 2. But 


ox dy az 
WV pap ys 
div V T + ay + az È 


so V cannot be curl F for any F. A 
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Laplacian 


The Laplace operator V?, which operates on functions f, is defined to be the diver- 
gence of the gradient: 


Gf Pf wf 
V fy. -— . 
Pes ax? $ ay? » az? 


This operator plays an important role in many physical laws, as we have mentioned 
in Section 3.1. 


VIRDEE] Show that V? f = 0 for 


1 1 


x, y, Z) = —————À-- 
feo MAU 


and (x, y. z) (0, 0, 0), 
where r = xi + yj + zk andr = |[r||. 
SOLUTION The first derivatives are 


of =x af =y af —z 
Ox (x? +y + 22)3/2 ? ay (x? +y? + 22)3/2* az (x? TX 4z2yy2 r 


Computing the second derivatives, we find that 


Bf 3x? 1 
ax? FPE (x2 + yp? H 
af 35? 1 
ay? (x? ES y? + z2nm (x? +y? + zp? 
af 3z? 1 
az? (x? y? ziyn (x? + y? $ z2yn P 


Thus, 


Sf Of . af 3(x? + y? +27) 3 
ax? ay? ' ez? (x? +y? 3E. z2)5/2 (x2 Ly? + zyn 
3 3 = 
GEP +P (x2 + y2 +2232 
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Vector Identities 


We now have these basic operations on hand: gradient, divergence, curl, and the 
Laplace operator. The following box contains some basic general formulas that are 
useful when computing with vector fields. 


Basic Identities of Vector Analysis 
|l. V(£*-g-Vf-*Vg 
Vicf) = cV f, for a constant c 
V(fg) = fVg t gVf 
V(f/g) = (gV f — f Vg)/g^, at points x where g(x) # 0 
div (F + G) = div F + div G 
curl (F 4- G) — curl F 4- curl G 
div (f F) = fdivF-- F-Vf 
div (F x G) = G-curl F — F - curl G 
div curl F = 0 
10. curl (fF) = fcul F+Vf xF 
ll. cud V£ —0 
12. Vi(fg)— f Vg gV^ f XVf-Vg) 
13. div(Vf x Vg) 20 
14. div(fVg—-gVf)— f Vg -gV^f 


3999.53 sv pA. gps (m JS 


DOVA Prove identity 7 in the preceding box. 
SOLUTION The vector field f F has components f F;, for i = 1,2, 3, and so 


a a a 
dv (SP = SUF) + 5: UP) UP. 


However, (0/80x)(f F1) = fdF\/dx + F10f/0x by the product rule, with similar ex- 
pressions for the other terms. Therefore, 


3F 3R P 
di F r ni 
wE (FZ ay ^ Oz 


=f(V-F)+F-Vf. A 


a a a 
)enzenifenz 
Ox oy oz 


Let us use these identities to redo Example 14. 
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LASIA Show that for r 4 0, V7(1/r) = 0. 


SOLUTION As in the case of the gravitational potential, V(1/r) = —r/r?. In 
general, V(r”) = nr"—?r (see Exercise 30). By the identity V - (f F) = fV-F+ 
Vf -F, we get 


eL A d Note 


Divergence and Curl 


William Rowan Hamilton, in his investigation of quaternions (discussed in 
Section 1.3) introduced the del operator, defined formally as 


a a a 
MI IJ 


Ox oy 0z 


Hamilton firmly believed in the significance of this operator. If 
f(x, y, 2) isa scalar function on R?, then “multiplication” by V gives the 
gradient of f: 


which, of course, gives the direction of steepest ascent (see Section 2.6). If 
V(x,y,2) = Vly zi + Valx.y,2)j + Va(x,9,2)k 
is a vector field, then the “quaternionic multiplication” of V with V yields 


VV = -div V+ curl V. 


Thus, what we now call the divergence of V is the negative of the scalar part 
of this product, and curl V is the vector part (c.f. the quaternion discussion in 
Section 1.3). 

As far as we are aware, Hamilton never gave a physical interpretation 
of divergence and curl, but he surely believed that, as a consequence of his 
faith in them, they must have an important physical interpretation. His faith 
in his mathematical formalism was justified, but a physical explanation of 
divergence and curl had to wait for James Clerk Maxwell’s Treatise on 
Electricity and Magnetism. Here, Maxwell used both the divergence and the 
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curl in his equations for the interaction of electric and magnetic fields (the 
Maxwell equations are discussed in Chapter 8). 

Curiously, Maxwell referred to divergence as convergence and to curl as 
rotation, a term still used in the literature. It was Josiah Gibbs (Figure 4.4.9) 
who renamed convergence and rotation as the more familiar terms we use 
today—divergence and curl. 


Figure 4.4.9 Josiah Willard 
Gibbs (1839-1903). 


Maxwell gave a physical interpretation of the divergence using the 
Gauss divergence theorem, as we do in Section 8.4. His physical 
interpretation of the curl as a rotation was rather brief. Gibbs provided a 
more elementary interpretation of divergence, as we do in this section. In the 
spirit of Leibniz (who believed in infinitesimal quantities dx,dy,dz), Gibbs 
imagined placing a small cube of dimensions dx by dy by dz in a fluid. The 
faces of this cube have areas dx dy, dy dz, and dx dz. 

At this point, students may be interested to hear Gibbs through the 
words of his student E. B. Wilson: 


Consider the amount of fluid which passes through those faces of 
the cube which are parallel to the YZ plane, i.e., perpendicular to the 
X axis [see Figure 4.4.10]. 

The normal to the face whose x coordinate is the lesser, that is, 
the normal to the left-hand face of the cube is —i. The flux of 
substance through this face is 


—i* V(x, y, z2) dy dz. 
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Cube with faces 


dy dz —> i dy dz parallel to the YZ plane. 


(x, y, z) (x dx, y, z) 


The normal to the opposite face, the face whose x coordinate is 
greater by the amount dx, is +i, and the flux through it is therefore 


i- V(x - dx, y,z)dydz=i- [ves a s] dydz 
x 
= i-V(x,9,2)dyda-+i- SV dxdydz. 
x 


The total flux outward from the cube through these two faces is 
therefore the algebraic sum of these quantities. This is simply 


i acd)dic. 9 dodi 
Ox Ox 


In like manner the fluxes through the other pairs of faces of the cube 
are 


1:289 mus and gee ay de 
oy 0z 


The total flux out from the cube is therefore 


av av av 
i- — +j-— +k-— }dxdydz. 
( Ox E oy m ~) at 


| This is the net quantity of fluid that leaves the cube per unit time. 
| The quotient of this by the volume dx dy dz of the cube gives the 
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rate of diminution of density. This is 


av av 80V 0V 98V 
a +k:-— = ! + ó 
oy dz Ox ay dz 


220 y pc. 

V-V=i- 5 tj 

Because V - V thus represents the diminution of density or the 
rate at which matter is leaving a point per unit volume per unit time, 
it is called the divergence. Maxwell employed the term convergence to 
denote the rate at which fluid approaches a point per unit volume 
per unit time. This is the negative of the divergence. In the case that 
the fluid is incompressible, as much matter must leave the cube as 
enters it. The total change of contents must therefore be zero. For 
this reason, the characteristic differential equation that any 
incompressible fluid must satisfy is 


YMO, 


where V is the velocity of the fluid. This equation is often known as 
the hydrodynamic equation. It is satisfied by any flow of water, since 
water is practically incompressible. The great importance of the 
equation for work in electricity is due to the fact that according to 
Maxwell's hypothesis, electric displacement obeys the same laws as 
an incompressible fluid. If, then, D is the electric displacement, 


divD=V-D=0. 


Gibbs’ interpretation of curl was much like the one we gave in Example 9 for 
the rotation of a rigid body. Wilson remarks that an analysis of the meaning 
of curl for fluid motion was "rather difficult." It remains a bit elusive, even 
today, as can be seen from our discussion following Example 9. We provide 
another interpretation in Chapter 8. 


EXERCISES 


Find the divergence of the vector fields in Exercises 1 to 4. 
1. V(x, y, z) = evi — ej --e"k 
2. V(x, y, z) = yzi + xzj + xyk 
3. V(x, y, z) = xi + (y + cosx)j + (z+ e?)k 
4. V(x. y, z) = xi + (x 4 yYjo- (x +y 4-zyk 


5. Figure 4.4.11 shows some flow lines and moving regions for a fluid moving in the plane 
field velocity field V. Where is div V > 0, and also where is div V < 0? 


4.4 Divergence and Curl 


>~ 


=P the plane. 


6. Let V(x, y, z) = xi be the velocity field of a fluid in space. Relate the sign of the 


divergence with the rate of change of volume under the flow. 
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x 
Qv, = ; ee 
e Figure 4.4.11 The flow lines of a fluid moving in 


7. Sketch a few flow lines for F(x, y) — yi. Calculate V - F and explain why your answer is 


consistent with your sketch. 


8. Sketch a few flow lines for F(x, y) = —3xi — yj. Calculate V - F and explain why your 


answer is consistent with your sketch. 

Calculate the divergence of the vector fields in Exercises 9 to 12. 
9. F(x, y) = x?i — x sin (xy)j 

10. F(x, y) 2 yi — xj 

11. F(x, y) = sin (xy)i — cos (x?y)j 

12. F(x, y) 2 xe'i — [y/G + »)li 

Compute the curl, V x F, of the vector fields in Exercises 13 to 16. 

13. F(x, y.z) = xi + yj + zk 

14. F(x, y, z) = yzi + xzj + xyk 

15. F(x, y, z) = (x? + y? 4 z2)i + 4j + 5k) 


yzi — xzj - xyk 
16. F(x, y, z) = ———— 
2) x? + y? +2? 
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Calculate the scalar curl of each of the vector fields in Exercises 17 to 20. 


17. 


18. 


19. 


20. 


F(x, y) = sin xi + cos xj 
FQ, y) = yi - xj 
F(x, y) = xyi + Q? — yj 


F(x, y) 7 xi *- yj 


Verify that V x (V f ) = 0 for the functions in Exercises 21 to 24. 


21. f(x, yz) x + y? + 22 
22. f(x,y,z) =xy - yz xz 
23. f(x, yz) = 1/0? y? +2) 
24. f(x,y,z) =x? y? x. yz 
25. Show that F = y(cos x)i + x(sin y)j is not a gradient vector field. 
26. Show that F = (x? + y?)i — 2xyj is not a gradient field. 
27. Prove identity 10 in the list of vector identities. 
28. Suppose that V - F = 0 and V - G = 0. Which of the following necessarily have zero 
divergence? 
(a) F+G (b) Fx G 
29. Let F = 2xz?^i + j + y?zxk and f = x?y. Compute the following quantities: 
(a) Vf (D V xF (c) Fx Vf (d) F-(V f) 
30. Let r(x, y, z) = (x, y. z) andr = x? + y? + 2? = ||r]|. Prove the following identities. 
(a) V(1/r) = —r/r?, r + 0; and, in general, V(r") = nr"~?r and V(logr) = r/r?. 
(b) VA(1/r) = 0, r 4 0; and, in general, V?r" = n(n + 1)r"?. 
(c) V - (r/r?) = 0; and, in general, V - (r^r) = (n + 3)r^. 
(d) V x r = 0; and, in general, V x (r^r) = 0. 
31. Does V x F have to be perpendicular to F? 
32. Let F(x, y, z) = 3x?yi + (X + yj. 


(a) Verify that curl F — 0. 
(b) Find a function f such that F — V f. (Techniques for constructing f in general are 


given in Chapter 8. The one in this problem should be sought by trial and error.) 
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33. Show that the real and imaginary parts of each of the following complex functions 
form the components of an irrotational and incompressible vector field in the plane; 


here i = y=]. 
(a) (x — iy? (b) (x — iy? (c) e^ = e*(cosy — i sin y) 


REVIEW EXERCISES FOR CHAPTER 4 


For Exercises 1 to 4, at the indicated point, compute the velocity vector, the acceleration 
vector; the speed, and the equation of the tangent line. 


1. c(t) 2 (P + 1, e^, cos(t/2)), att = 1 
2. e(t) = (P? — 1, cos (t°), tt), at t = fr 


3. c(t) = (e', sint, cost), att — 0 


2 


f 
4. et) = Lob atro 2 


5. Calculate the tangent and acceleration vectors for the helix e(t) = (cost, sin, t) at 
t = m /A. 


6. Calculate the tangent and acceleration vector for the cycloid e(t) = (t — sint, 1 — cost) 
at t = 2/4 and sketch. 


7. Leta particle of mass m move on the path e(t) = (t°, sin t, cos t). Compute the force 
acting on the particle at t = 0. 


8. (a) Let c(r) bea path with ||c(t)|| = constant; that is, the curve lies on a sphere. Show 
that c'(t) is orthogonal to c(7). 
(b) Let c be a path whose speed is never zero. Show that c has constant speed if and only 
if the acceleration vector c" is always perpendicular to the velocity vector c'. 


9. Express the arc length of the curve x? = y? = z? between x = | and x = 4 as an 
integral, using a suitable parametrization. 


10. Find the arc length of e(t) = ti + (log t)j + 2/2tk for 1 < t <2. 


11. A particle is constrained to move around the unit circle in the xy plane according to the 
formula (x, y, z) = (cos (/?), sin (t°), 0), t > 0. 


(a) What are the velocity vector and speed of the particle as functions of t? 

(b) At what point on the circle should the particle be released to hit a target at (2, 0, 0)? 
(Be careful about which direction the particle is moving around the circle.) 

(c) At what time ¢ should the release take place? (Use the smallest ż > 0 that will work.) 

(d) What are the velocity and speed at the time of release? 

(e) At what time is the target hit? 
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12. A particle of mass m moves under the influence of a force F = —kr, where k is a 
constant and r(¢) is the position of the particle at time f. 


(a) Write down differential equations for the components of r(t). 
(b) Solve the equations in part (a) subject to the initial conditions r(0) = 0, 
r'(0) =2j+k. 
13. Write the curve described by the equations x — 1 = 2y + 1 = 3z + 2 in parametric form. 


14. Write the curve x = y? = z? + 1 in parametric form. 


15. Show that e(t) = (1/(1 — t), 0, e'/(1 — r)) is a flow line of the vector field defined by 
F(x, y, z) = (x^, 0, 2(1 + x)). 


16. Let F(x, y) = f(x? + y?)[—yi+ xj] for a function f of one variable. What equation 
must g(t) satisfy for 


c(t) = [cos g()}i + [sin g(7)]i 
to be a flow line for F? 
Compute V - F and V x F for the vector fields in Exercises 17 to 20. 
17. F = 2xi + 3yj + 4zk 
18. F = x?i + y!jbrzk 
19. F= (x + y)i+ (y +z)j+(z+x)k 
20. F = xi + 3xyj + zk 


Compute the divergence and curl of the vector fields in Exercises 21 and 22 at the points 
indicated. 


21. F(x, y, z) = yi + zj + xk, at the point (1, 1, 1) 


22. F(x, y, z) = (x + yyi + (sin xy)j + (cosxyz)k, at the point (2, 0, 1) 
Calculate the gradients of the functions in Exercises 23 to 26, and verify that V x V f = 0. 
23. f(x,y) =e” + cos(xy) 


A an 


24. f(x,y)— Fair 


25. f(x, y) =e" — cos (xy?) 


26. f(x, y) = tan! (x? + y?) 
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27. (a) Let f(x, y, z) = xyz?; compute V f. 

(b) Let F(x, y, z) = xyi + yzj + zyk; compute V x F. 

(c) Compute V x (fF) using identity 10 of the list of vector identities. Compare with a 
direct computation. 


28. (a) Let F = 2xye7i + e*x?j + (x? ye + z?)k. Compute V - F and V x F. 
(b) Find a function f(x, y, z) such that F = V f. 


29. Let F(x, y) = f(x? + »?)[-yi + xj], as in Exercise 16. Calculate div F and curl F and 
discuss your answers in view of the results of Exercise 16. 


30. Leta particle of mass m move along the elliptical helix c(t) — (4 cos t, sin t, t). 


(a) Find the equation of the tangent line to the helix at t = 7/4. 

(b) Find the force acting on the particle at time t = 77/4. 

(c) Write an expression (in terms of an integral) for the arc length of the curve e(t) 
between t = 0 and t = 7/4. 


31. (a) Let g(x, y, zZ) = x? + Syz + z? and let A(w) be a function of one variable such that 
h'(1) = 1/2. Let f — h o g. Starting at (1, 0, 0), in what directions is f changing at 50% of 
its maximum rate? 

(b) For g(x, y, z) = x? + 5yz + 2°, calculate F = Vg, the gradient of g, and verify 
directly that V x F — 0 at each point (x, y, z). 


32. (a) Write in parametric form the curve that is the intersection of the surfaces 
x! y! +z —3andy =l. 

(b) Find the equation of the line tangent to this curve at (1, 1, 1). 

(c) Write an integral expression for the arc length of this curve. What is the value of this 
integral? 


33. In meteorology, the negative pressure gradient G is a vector quantity that points from 
regions of high pressure to regions of low pressure, normal to the lines of constant pressure 
(isobars). 


(a) In an xy coordinate system, 


Write a formula for the magnitude of the negative pressure gradient. 

(b) Ifthe horizontal pressure gradient provided the only horizontal force acting on the 
air, the wind would blow directly across the isobars in the direction of G, and for a given 
air mass, with acceleration proportional to the magnitude of G. Explain, using Newton's 
second law. 

(c) Because of the rotation of the earth, the wind does not blow in the direction that part 
(b) would suggest. Instead, it obeys Buys—Ballot’s law, which states: “If in the Northern 
Hemisphere, you stand with your back to the wind, the high pressure is on your right and the 
low pressure is on your left." Draw a figure and introduce xy coordinates so that G points in 
the proper direction. 

(d) State and graphically illustrate Buys-Ballot's law for the Southern Hemisphere, in 
which the orientation of high and low pressure is reversed. 
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34. A sphere of mass m, radius a, and uniform density has potential u and gravitational force 
F, at a distance r from the center (0, 0, 0), given by 


3m mr? m (<a) 

u - j -—r r € a) 
2a | 2: aè 
m m 

u= —, F=-<sr (r >a). 
r r 


Here, r = |Ir||, r = xi + yj + zk. 


(a) Verify that F = Vu on the inside and outside of the sphere. 

(b) Check that u satisfies Poisson’s equation: 3?u/3x? + 0?u/8y? + 9?u/0z? = 
constant inside the sphere. 

(c) Show that w satisfies Laplace's equation: 9^u/8x? + 9u/8y? + 3?u/3z? = 0 
outside the sphere. 


35. A circular helix that lies on the cylinder x? + y? = R? with pitch p may be described 
parametrically by 


x = Rcos6, y= Rsin6, z= pð, 0-0. 


A particle slides under the action of gravity (which acts parallel to the z axis) without friction 
along the helix. If the particle starts out at the height zo > 0, then when it reaches the height z 
along the helix, its speed is given by 


E o Jee, 


dt 


where s is arc length along the helix, g is the constant of gravity, t is time, and 0 < z < zo. 


(a) Find the length of the part of the helix between the planes z — zo and 
z-—z,0rz < Zo. 
(b) Compute the time Tù it takes the particle to reach the plane z = 0. 


36. A sphere of radius 10 centimeters (cm) with center at (0, 0, 0) rotates about the z axis 
with angular velocity 4 in such a direction that the rotation looks counterclockwise from the 
positive z axis. 


(a) Find the rotation vector w (see Example 9, in Section 4.4). 
(b) Find the velocity v = w x r when r = 54/2(i — j) is on the “equator.” 
(c) Find the velocity of the point (0, 54/3, 5) on the sphere. 


37. Find the speed of the students in a classroom located at a latitude 49°N due to the 
rotation of the earth. (Ignore the motion of the earth about the sun, the sun in the galaxy, 
etc.; the radius of the earth is 3960 miles.) 


5) 
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Double and Triple 


Integrals 


It is to Archimedes himself (c. 225 B.c.) that we owe the nearest approach 
to actual integration to be found among the Greeks. His first noteworthy 
advance in this direction was concerned with his proof that the area of a 
parabolic segment is four thirds of the triangle with the same base and 
vertex, or two thirds of the circumscribed parallelogram. 


QD. €. Smith, History of Mathematics 


L this chapter and the next we study the integration of real-valued functions of 
several variables; this chapter treats integrals of functions of two and three variables, 
or double and triple integrals. The double integral has a basic geometric interpretation 
as volume, and can be defined rigorously as a limit of approximating sums. We shall 
present several techniques for evaluating double and triple integrals and consider 
some applications. 


5.1 Introduction 


This section discusses some geometric aspects of the double integral, deferring a 
more rigorous discussion in terms of Riemann sums until Section 5.2. 


Double Integrals as Volumes 


Consider a continuous function of two variables f: R C R? — R whose domain R is 
arectangle with sides parallel to the coordinate axes. The rectangle R can be described 
in terms ofthe two closed intervals [a, b] and [c, d], representing the sides of R along 
the x and y axes, respectively, as in Figure 5.1.1. In this case, we say that R is the 
Cartesian product of [a, b] and [c, d] and write R = [a, b] x [c, d]. 
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Assume that f(x, y) > 0 on R, so that the graph ofz = f(x, y) isa surface lying 
above the rectangle R. This surface, the rectangle R, and the four planes x = a, x = 
b, y = c, and y = d form the boundary of a region V in space (see Figure 5.1.1). 


Graph of 
z=f(x,y) 


Figure 5.1.1 The region V in space is bounded by 
the graph of f, the rectangle R, and the four 
y vertical sides indicated. 


The problem of how to rigorously define the volume of V has to be faced, and 
we shall solve it in Section 5.2 by the classic method of exhaustion, or rather, in more 
modern terms, the method of Riemann sums. To gain an intuitive grasp of the double 
integral, we provisionally assume that the volume of a region has been defined. 


Double Integrals The volume of the region above R and under the graph of a 
nonnegative function f is called the (double) integral of f over R and is denoted 
by 


J| (e 2^. or [| 1e o 


| DJB (a) If fisdefinedby f(x, y) = k, where k isa positive constant, 
then 


J| 1e 24-6 - 9 -o. 

R 

because the integral is equal to the volume of a rectangular box with base R and 
height k. 

(b) If f(x, y) 2 1 — x and R = [0, 1] x [0, 1], then 


[[ 1992475. 


because the integral is equal to the volume of the triangular solid shown in Figure 
5.1.2. A 


[SWISS] Suppose z= f(x, y) = x? + y? and R = [-1, 1] x [0, 1]. Then 
the integral ff, (x? + y?) dx dy is equal to the volume of the solid sketched in 
Figure 5.1.3. We shall compute this integral in Example3. A 
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z 


Figure 5.1.2 Volume under the graph z = 1 — x and 
over R = [0, 1] x [0, 1]. 


(0, 1, 0) 


(1, 1, 0) 


Figure 5.1.3 Volume under z = x? + y? 
and over R = [—1, 1] x [0, 1]. 


These ideas are similar to those for a single integral Jf f(x) dx, which represents 
the area under the graph of f if f > 0; see Figure 5.1.4.! 

Single integrals la f(x) dx can be rigorously defined, without recourse to the 
area concept, as a limit of Riemann sums. The idea is to approximate f, | f(x)dx 
by choosing a partition a = xo < xı <--- < Xn = b of [a, b], selecting points c; € 
[x;, xi41], and forming the Riemann sum 


n-l b 
Y resa x) f fes 
i=0 @ 


! Readers not already familiar with this idea should review the appropriate sections of their introductory calculus text. 
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Figure 5.1.4 Area under the graph of a nonnegative 
continuous function f from x = a to x = bis L S(x)dx. 


b 


(see Figure 5.1.5). We examine the analogous process for double integrals in the next 
section. 


Figure 5.1.5 The sum of the areas of the 
y =f(x) Shaded rectangles is a Riemann sum, 
which approximates the area under f from 
x-—atox — b. 


4 7X9 €) Xj CQ X? Q Xy € X47 b 


Cavalieri's Principle 


There is a useful method for computing volumes, known as Cavalieri s principle. 
Suppose we have a solid body and we let A(x) denote its cross-sectional area in a 
plane P, measured at a distance x from a reference plane (Figure 5.1.6). 


A(x) = area of 
cross section 


Figure 5.1.6 A solid body with 
cross-sectional area A(x) at distance x 
from a reference plane. 


Reference plane 
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According to Cavalieri's principle, the volume of the body is given by 
b 
volume — il A(x) dx, 
a 


where a and 5 are the minimum and maximum distances from the reference plane. 
This can be made intuitively clear as follows. If we partition [a, b] into a = xo < 
Xi <--> <x, = b, then an approximating Riemann sum for the preceding inte- 
gral is 


n-l 
» A(ci)(xi41 — xi). 


i=0 


But this sum also approximates the volume of the body, because A(x) Ax is the vol- 
ume of a slab with cross-sectional area A(x) and thickness Ax (Figure 5.1.7). There- 
fore, it is reasonable to accept the preceding formula for the volume. A more 
careful justification of this method is given in the Internet supplement for Chap- 
ter 5. 


Figure 5.1.7 Volume of a slab with cross-sectional area A(x) and thickness 


A(x) 
Ax equals A(x)Ax. The total volume of the body is if A(x) dx. 


The Slice Method — Cavalieri’ s Principle Let S be a solid and, for x sat- 
isfying a < x < b, let P, be a family of parallel planes such that: 


l. S lies between P; and P; 
2. The area of the slice of S cut by P, is A(x). 


Then the volume of S is equal to 


1 A(x) dx. 
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Arx Bi m Note 


Bonaventura Cavalieri (1598-1647) was a pupil of Galileo and a professor in 
Bologna. His investigations into area and volume were important building 


blocks of the foundations of calculus. Although his methods were criticized by 
his contemporaries, similar ideas had been used by Archimedes in antiquity, 
and were later taken up by the “fathers” of calculus, Newton and Leibniz. 


Reduction to Iterated Integrals 


We now use Cavalieri’s principle to evaluate double integrals. Consider the solid 
region under a graph z = f(x, y) defined on the region [a, b] x [c, d], where f is 
continuous and greater than zero. There are two natural cross-sectional area functions: 
one obtained by using cutting planes perpendicular to the x axis, and the other obtained 
by using cutting planes perpendicular to the y axis. The cross section determined 
by a cutting plane x — xo, of the first sort, is the plane region under the graph of 
z= f'(xo, y) from y = c to y = d (Figure 5.1.8). 


d z=f(%y) 


Figure 5.1.8 Two different cross 
d sections sweeping out the volume under 
i” z= fey). 


When we fix x = xo, we obtain the function y > /(xo, y), whichis continuous on 
[c, d]. The cross-sectional area A(xo) is, therefore, equal to the integral E fŒ, y) dy. 
Thus, the cross-sectional area function A has domain [a, 5], and is given by the rule 
Aux b J a f(x, y) dy. By Cavalieri's principle, the volume V of the region under 
z — f(x, y) must be equal to 


v= ff aes - f | [ roo] 


The integral f, ja [ye 7 f(x, y) dy] dx is known as an iterated integral because it is 
obtained by integrating with respect to y and then integrating the result with respect 
to x. Because ffp f(x, y) d A is equal to the volume V, we get the following result. 
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[frena [Lf remo ]o- a) 


If we use cutting planes perpendicular to the y axis, we obtain 


f(x. y)dA = : ET dy. Q) 
IÍ, Lu 


The expression on the right of formula (2) is the iterated integral obtained by 
integrating with respect to x and then integrating the result with respect to y. 


Thus, if our intuition about volumes is correct, formulas (1) and (2) ought to be 
valid. This is in fact true when the concepts we are discussing are defined rigorously, 
and is known as Fubini 5 theorem. We give a proof of this theorem in the next section. 

As the following examples illustrate, the notion of the iterated integral and equa- 
tions (1) and (2) provide a powerful method for computing the double integral of a 
function of two variables. 


EXAMPLE 3| Evaluate the integral 


I (x? + y?) dx dy, 
R 
where R = [—1, 1] x [0, 1]. 


SOLUTION By equation (2), 


[|era f [s G^ eds a. 


To find fe (x? + y?) dx, we treat y as a constant and integrate with respect to x. 
Because x?/3 + y?x is an antiderivative of x? + y? with respect to x, we can integrate, 
using the fundamental theorem of calculus, to obtain 
1 x3 L 2 
il (+ y?)dx = [es] =% +2. 
Ld 3 Eg PAS 


Next, we integrate 2 + 2y? with respect to y from 0 to 1, to obtain 


[Gre 2 24 4 
lo pur ur pw x 7 3" Las 3 


Hence, the volume of the solid we saw in Figure 5.1.3 is 4/3. 
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For completeness, let us evaluate [f^ (x? + y?) dx dy using equation (1)—that 
is, integrating with respect to y first and then with respect to x. We have 


f[ eme -[ | f eee 


Treating x as a constant in the y integration, we obtain 


1 y] 1 
[ Q? 4 y)ady = 5t =x +>. 
0 3 y=0 3 


Next, we evaluate fe. (x? + 4) dx to obtain 


i 1 a? x] 4 
2 

mds] zat «€ 

[ (esse Ek 3 


which agrees with our previous answer. A 


DAAG Compute the double integral //, cos x sin y dx dy, where S is the 
square [0, 2/2] x [0, 2/2] (see Figure 5.1.9). 


z=cosxsiny 


Figure 5.1.9 Volume under z = cos x sin y 
and over the rectangle [0, 2/2] x [0, 2/2]. 


SOLUTION By equation (2), 


z/lp px/2 
If cosx sin y dx dy =f pi cosx sin ya o 
S 0 0 
z/2 z/2 z/2 
«f | f cosx dx [dy E sinydy —1. A 
0 0 0 
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In the next section, we shall use Riemann sums to rigorously define the double 
integral for a large class of functions of two variables without recourse to the notion 
of volume. Although we shall drop the requirement that f(x, y) > 0, equations (1) 
and (2) will remain valid. Therefore, the iterated integral will again provide the key to 
computing the double integral. In Section 5.3, we treat double integrals over regions 
more general than rectangles. 

Finally, we remark that it is common to delete the brackets in iterated integrals 
such as equations (1) and (2) and write 


f [ (eoo. in place of f L[ oe e]e 


and 
d fb dp pb 
f Í f(x, y)dx dy in place of Í fi es. as] dy. 
e a € a 
EXERCISES 
1. Evaluate the following iterated integrals: 
(a) f, fo hy - y?) dy dx © fy fo Ge) dy dx 
(b) ff^ fo(ycosx 4-2)dy dx (à) f°, fA (Cx logy) dy dx 


2. Evaluate the integrals in Exercise 1 by integrating with respect to x and then with respect 
to y. [The solution to part (b) only is in the Study Guide to this text.] 


3. Use Cavalieri's principle to show that the volumes of two cylinders with the same base 
and height are equal (see Figure 5.1.10). 


Figure 5.1.10 Two cylinders 
with the same base and height 
have the same volume. 


4. Using Cavalieri's principle, compute the volume of the structure shown in Figure 5.1.11; 
each cross section is a rectangle of length 5 and width 3. 
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Figure 5.1.11 Compute this volume. 


5. A lumberjack cuts out a wedge-shaped piece W of a cylindrical tree of radius r obtained 
by making two saw cuts to the tree's center, one horizontally and one at an angle 6. Compute 
the volume of the wedge W using Cavalieri's principle. (See Figure 5.1.12.) 


Figure 5.1.12 Find the volume of W. 


6. (a) Show that the volume of the solid of revolution shown in Figure 5.1.13(a) is 


b 
x f treoras. 


(b) Show that the volume of the region obtained by rotating the region under the graph 
of the parabola y = —x? + 2x +3, —1 < x < 3, about the x axis is 5127/15 [see Figure 
5.1.13(b)]. 


Figure 5.1.13 The solid of 
revolution (a) has volume 

x fA Lf GO. dx. Part (b) shows 
the region between the graph of 
y = —x? + 2x 4-3 and the x 
axis rotated about the x axis. 


(a) (b) 
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Evaluate the double integrals in Exercises 7 to 9, where R is the rectangle [0, 2] x [—1, 0]. 


* Il (x?y? + x)dy dx 
R 
1 
8. I (Ivl cos z7x) dy dx 
at 
9. ji) (7x€ sin 2) dy dx 


10. Find the volume bounded by the graph of f(x, y) = 1 + 2x + 3y, the rectangle 
[1, 2] x [0, 1], and the four vertical sides of the rectangle R, as in Figure 5.1.1. 


11. Repeat Exercise 10 for the function f(x, y) = x* + y? and the rectangle 
[-1, 1] x [-3, -2]. 


5.2 The Double Integral Over a Rectangle 


We are ready to give a rigorous definition of the double integral as the limit of a 
sequence of sums. This will then be used to define the volume of the region under 
the graph of a function f(x, y). We shall not require that f(x, y) > 0; but if f(x, y) 
assumes negative values, we shall interpret the integral as a signed volume, just as for 
the area under the graph of a function of one variable. In addition, we shall discuss 
some of the fundamental algebraic properties of the double integral and prove Fubini’s 
theorem, which states that the double integral can be calculated as an iterated integral. 
To begin, let us establish some notation for partitions and sums. 


Definition of the Integral 


Consider a closed rectangle R C R?; that is, R is a Cartesian product of two inter- 
vals: R = [a, b] x [c, d]. By a regular partition of R of order n we mean the two 
ordered collections of n + 1 equally spaced points (x;)5 y and {yx}{_9, that is, the 
points satisfying 


a—xQ9«xp«-:«x,—b, e= yj <y < y =d 
and 


b—a d—c 
Xj -Xj = > Yl — Vk = 


(see Figure 5.2.1). 

A function f(x, y) is said to be bounded if there is a number M > 0 such that 
—M < f(x, y) < M for all (x, y) in the domain of f. A continuous function on a 
closed rectangle is always bounded, but, for example, f(x, y) = 1/x on (0, 1] x [0, 1] 
is continuous but is not bounded, because 1/x becomes arbitrarily large for x near 0. 
The rectangle (0, 1] x [0, 1] is not closed, because the endpoint 0 is missing in the 
first factor. 
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Figure 5.2.1 A regular partition ofa 
rectangle R, with n = 4. 


a= X9 Xi X3 X3 x4=b 


Let Rjx be the rectangle [x;, x 41] x [vis Yk+1ı], and let cjg be any point in Rjg. 
Suppose f: R — R is a bounded real-valued function. Form the sum 


n=l n-l 
S, = Y f(ej) Ax Ay = Y fle) AA, 0) 
j k=0 jK—0 
where 
b—a d—c 
Ax —Xjua-xj $00 AY = Yk — Ye j 
n n 
and 
AA = Ax Ay. 


This sum is taken over all j’s and k’s from 0 to n — 1, and so there are n? terms. A 
sum of this type is called a Riemann sum for f. 


DEFINITION: Double Integral If the sequence (5,) converges to a limit 
S as n — oo and if the limit S is the same for any choice of points c;; in the 
rectangles R jx, then we say that f is integrable over R and we write 


n [| 1e 5o. or [[ ro 


for the limit S. 


Thus, we can rewrite integrability in the following way: 


n-l 
limi Ð Sew x av= ff rax» 


for any choice of cj, € Rjg. 
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Properties of the Integral 
The proof of the following basic theorem is presented in the Internet supplement for 
Chapter 5. 


THEOREM 1 Any continuous function defined on a closed rectangle R is 
integrable. 


If f(x, y) = 0, the existence of limit, 4; S, has a straightforward geometric 
meaning. Consider the graph of z — f(x, y) as the top of a solid whose base is the 
rectangle R. If we take each c;, to be a point where f(x, y) has its minimum value? 
on Rjg, then f (cj) Ax Ay represents the volume of a rectangular box with base R jx. 
The sum DO es f(ejx) Ax Ay equals the volume of an inscribed solid, part of which 
is shown in Figure 5.2.2. 


I(x) 
Figure 5.2.2 The sum of inscribed boxes 
approximates the volume under the graph of 


z= fœ, y). 


Similarly, if c;, is a point where f(x, y) has its maximum on R jx, then the sum 
Xs f (cj) Ax Ay is equal to the volume of a circumscribed solid (see Figure 
5.2.3). 

Therefore, if limit, oo S, exists and is independent of ¢;, € R jx, it follows that 
the volumes of the inscribed and circumscribed solids approach the same limit as 
n — oo. It is therefore reasonable to call this limit the exact volume of the solid 
under the graph of f. Thus, the method of Riemann sums supports the concepts 
introduced on an intuitive basis in Section 5.1. 

There is a theorem guaranteeing the existence of the integral of certain discon- 
tinuous functions as well. We shall need this result in the next section in order to 
discuss the integrals of functions over regions more general than rectangles. We shall 
be specifically interested in functions whose discontinuities lie on curves in the xy 
plane. Figure 5.2.4 shows two functions defined on a rectangle R whose discontinuities 


?Such c;, exist by virtue of the continuity of f on R; see Theorem 7 in Section 3.3. 
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Figure 5.2.3 The volume of circumscribed boxes 
also approximates the volume under z — f(x, y). 


A "broken surface” 


S ze) 4 = . miis 
—a NU i 


y 


Set of discontinuities of f x 
Set of discontinuities of f 
Figure 5.2.4 What the graphs of discontinuous functions of two variables might 
look like. 


lie along curves. In other words, f is continuous at each point that is in A, but not 
necessarily on the curve. 
Useful curves are graphs of functions suchas y = $(x), a < x € b,orx = W(y), 
c € y < d,orfinite unions of such graphs. Some examples are shown in Figure 5.2.5. 
The next theorem provides an important criterion for determining whether a 
function is integrable. The proof is discussed in the Internet supplement. 


THEOREM 2: Integrability of Bounded Functions Let f: R > R 
be a bounded real-valued function on the rectangle R, and suppose that the set 
of points where f is discontinuous lies on a finite union of graphs of continuous 
functions. Then f is integrable over R. 
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YA 


Y= 2%) 


e. 


HIR Md. 
/ 
f 
\ 
\ 


Figure 5.2.5 Curves in the plane represented as graphs. 


Using Theorem 2 and the remarks preceding it, we see that the functions sketched 
in Figure 5.2.4 are integrable over R, because these functions are bounded and con- 
tinuous except on graphs of continuous functions. 

From the definition of the integral as a limit of sums and the limit theorems, 
we can deduce some fundamental properties of the integral ff, n f (x, y) d 4; these 
properties are essentially the same as for the integral of a real-valued function of a 
single variable. 

Let f and g be integrable functions on the rectangle R, and let c be a constant. 
Then f 4- g and cf are integrable, and 


(i) Linearity 


[[ re e naa ff feda ff senda. 


(ii) Homogeneity 


If, cfs. yd =e ff 6.22. 


(iii) Monotonicity If f(x, y) > g(x, y), then 


[[, 1e aa 2 ff s aa. 


(iv) Additivity If R;,i = 1, ..., m, are pairwise disjoint rectangles such that f 
is bounded and integrable over each R; and if Q = Rj U R2U--- U Rm isa 
rectangle, then f: Q — R is integrable over Q and 


ff re - ZA fle, yd A. 
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Properties (i) and (ii) are a consequence of the definition of the integral as a limit 
of a sum and the following facts for convergent sequences (5,) and (7; ), which are 
proved as with the limit theorems in Chapter 2: 


limit (7, + S,) = limit 7, + limit S, 
100 noo noo 
limit (cS,) = c limit S,,. 
noo noo 
To demonstrate monotonicity, we first observe that if h(x, y) > 0 and {S,} is a 
sequence of Riemann sums that converges to S ff, pix, y) d A, then S, > 0 for all n, so 


that [fp h(x, y) d A = limit, oS, > 0. If f(x, y) = g(x, y) for all (x, y) € R, then 
(f — g)(x, y) = 0 for all (x, y), and, using properties (i) and (ii), we have 


If sada- ff a y)d4= ff L(x, y) — g(r, yd > 0. 


This proves property (iii). The proof of property (iv) is more technical and a special 
case is proved in the Internet supplement. It should be intuitively obvious. 
Another important result is the inequality 


|f] n 


To see why formula (2) is true, note that, by the definition of absolute value, 


-[f| & sfl 


therefore, from the monotonicity and homogeneity of integration (with c = —1), 


- ff nans ff seas ff naa, 


which is equivalent to formula (2). 


< II /1dA. O 


Fubini’s Theorem 


Although we have noted the integrability of a variety of functions, we have not yet 
established rigorously a general method of computing integrals. In the case of one 
variable, we avoid computing f, b f(x) dx from its definition as a limit of a sum by 
using the fundamental theorem of integral calculus. This important theorem tells us 
that if f is continuous, then 


b 
f fG)dx = FQ) — F(a), 


where F is an antiderivative of f ; that is, F' = f. 
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This technique will not work as stated for functions f(x, y) of two variables. 
However, as we indicate in Section 5.1, we can often reduce a double integral over a 
rectangle to iterated single integrals; the fundamental theorem then applies to each 
of these single integrals. Fubini's theorem, which was mentioned in the last section, 
establishes this reduction to iterated integrals rigorously, by using Riemann sums. As 
we saw in Section 5.1, the reduction, 


ECEE [ n Fes. y] dx = L nS ds 


is a consequence of Cavalieri’s principle, at least if f(x, y) > 0. In terms of Riemann 
sums, it corresponds to the following equality: 


n-l n—l fn-|l n—1 fn-l 
Y fej) Ax Ay = 37 (E Sen) J Ax 2» (Y Fle) as) Ay, 
j-0 \k=0 


j4—0 k=0 \j=0 


which may be proved more generally as follows: Let [ają] be an n x n matrix, 


where0 < j € n — V and0 € k € n — l. Let MT ajk be the sum of the n? matrix 


entries. Then 


n-l n—-l /n-1 n—l fn-l 
b» ajk = b» (E 2 = yy ( D : (3) 
k=0 


jk=0 j-0 4-0 


In the first equality, the right-hand side represents summing the matrix entries 
first by rows and then adding the results: 


PAL 
a00 agı 402 aok pe Q(n—1) rar 
n-1 
ajo aj ajk 700 Ajn- >» üjk 
k=0 
—————M n-l 
G(n—l10 — G(n-1) t Anke +t An-n) Yan 
k=0 


Clearly, this is equal to prm ajk, that is, the sum of all the a;,. Similarly, the 
sum XY DET a jk) represents a summing of the matrix entries by columns. This 
establishes equation (3) and makes the reduction to iterated integrals quite plausible 
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if we remember that integrals can be approximated by the corresponding Riemann 
sums. The actual proof of Fubini's theorem exploits this idea. 


THEOREM 3: Fubini's Theorem Let f bea continuous function with 
a rectangular domain R = [a, b] x [c, d]. Then 


b d d b 
| | tenarac= ff Tæ xay = ff fo. naa. 4) 
a È È a 
PROOF We shall first show that 


Í i fle, y) dydx = II f(x. yd. 


Let c = yo < yı <---> < y, = d be a partition of [c, d] into n equal parts. Define 


d 
F()= f flr, y)dy. 


Then 
n=l yia 
Fx)-3 | fe». 
k=0 "Y 


Using the integral version of the mean-value theorem,? for each fixed x and for each 
k we have 


Vee 
f foe, dy = fE, nDO y) 


Yk 


(see Figure 5.2.6), where the point Y;(x) belongs to [yx, yk+1ı] and may depend on 
x, k, and n. 
We have thus shown that 


n-l 
F(x) = Y fos Ye) — y). (5) 


k=0 


3 This states that if g(x) is continuous on [a, b], then Tu g(x) dx = g(c)(b — a) for some point c € [a, b]. The more general 
second mean-value theorem was proved in Section 3.2. 


U 
W 
un 
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z=fx,y) 


f M 
a 150 . d= Figure 5.2.6 The notation 
E ae zi Yat y va Ys Y6 i ME c E any needed in the proof of 
Fubini's theorem; n — 8. 


By the definition of the integral in one variable as a limit of Riemann sums, 


b b d n-l 
[ roe- [f | J fava] dx = limit Ý F(p,)(xj+1 — xj. 
a a c j=0 


where a = x9 < x} < ++- < x, = b is a partition of the interval [a, b] into n equal 
parts and p; is any point in [x;, x;41]. Setting cj; = (pj, Y4(p;)) € Rix, we have 
[substituting p; for x in equation (5)] 


n-l 


F(pj) = D> f(ej)Orea — ya). 
k=0 
Therefore, 


b pd n-l 
f [ re ome f Heide = s (Zj — xj) 


n—l n—l 


= limit ^3 ^ f(ej)Onea — ya — xj) 
j=0 k=0 


- ff f(x, y) dA. 
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Thus, we have proved that 


[ [ 1e »o- ff fy) dA. 


By the same reasoning we can show that 


f [ 1e» ff fæ 9) dA. 


These two conclusions are exactly what we wanted to prove. E 


Fubini's theorem can be generalized to the case where f is not necessarily contin- 
uous. Although we shall not present a proof, we state here this more general version. 


THEOREM 3’: Fubini's Theorem Let f be a bounded function with do- 
main a rectangle R = [a, b] x [c, d], and suppose the discontinuities of f lie on a 
finite union of graphs of continuous functions. If the integral ni f(x, y) dy exists for 


each x € [a, b], then 
bp pd 
f | f Pes. ay] ax 


exists and 


i "ebde f(x, y)dA. 
a Je R 


Similarly, if [^ f(x, y) dx exists for each y € [c, d], then 


f Lem] dy 


exists and 


[ [ 7e» ff foe, yd A. 


Thus, if all these conditions hold simultaneously, 


[ [ reme - f f roaa [f f(x. y)dA. 
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The assumptions made for this version of Fubini's theorem are more complicated 
than those we made in Theorem 3. They are necessary because if f is not continuous 


everywhere, for example, there is no guarantee that ff fx, y) dy will exist for each x. 


DSA Compute ff, (x? + y) dA, where R is the square [0, 1] x [0, 1]. 
SOLUTION By Fubini's theorem, 


[ferna f [eraa f || e oe Jo» 


By the fundamental theorem of calculus, the x integration may be performed: 


1 x3 1 1 
[eo dx =| e| =- +y. 
0 3 us. 3 


Thus, 


E 1 y] s 
2 = = mm Pe ae =- 
Hc +yda= f [; eo-[»* ds 


What we have done is hold y fixed, integrate with respect to x, and then evaluate the 
result between the given limits for the x variable. Next, we integrated the remaining 
function (of y alone) with respect to y to obtain the final answer. A 


HGAI A consequence of Fubini's theorem is that interchanging the order 
of integration in the iterated integrals does not change the answer. Verify this for 
Example 1. 


SOLUTION We carry out the integration in the other order: 


1 pl 1 y] 1 1 
i [ 6^ +y)dydx = | [y+ 2] aaf [+5] dx 
o Jo 0 2 jy-o 0 2 


[eT A 
L3 2], 6 


We have seen that when f(x,y)>0 on R= [a,b] x [c,d], the integral 
Ff rR JŒ, y) dA can be interpreted as a volume. If the function also takes on nega- 
tive values, then the double integral can be thought of as the sum of all volumes 
lying between the surface z = f(x, y) and the plane z = 0, bounded by the planes 
x =a,x = b, y = c, and y = d; here the volumes above z = 0 are counted as posi- 
tive and those below as negative. However, Fubini's theorem as stated remains valid in 
the case where f(x, y) is negative or changes sign on R; that is, there is no restriction 
on the sign of f in the hypotheses of the theorem. 
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GAT] Let R be the rectangle [—2, 1] x [0, 1] and let f be defined by 
fx, y) = yG? — 12x); f(x, y) takes on both positive and negative values on R. 
Evaluate the integral [fp f(x, y) dx dy = ff, y? — 12x) dx dy. 


SOLUTION By Fubini’s theorem, we may write 


5 ie ae 5! 57 
y(x — 12x)dx dy = y(x —12x)dx|dy = — | ydy=—. 
R o LJ-2 4 Jo 8 


Alternatively, integrating first with respect to y, we find 


Il yG? — I2x)dy dx = f. n = 234] dx 


N LU 1[x* 1 87 
as 12x = 3 . 
IR )dx F 6x L z ô 


E es A Note 


The Riemann Integral 


The first time most mathematics students encounter the name of Bernhard 
Riemann is in their calculus courses, where they read about the Riemann 
integral. Leibniz had thought of the integral of a function of one variable as 
an infinite sum (the f standing for a sum) of infinitesimal areas f(x) dx, 
where dx is an “infinitesimal width” and f(x) is the height of the 
corresponding “infinitesimally thin” rectangle. This intuitive approach 
sufficed for most purposes because the fundamental theorem 


b 
[ Fods = FO) - Fio) 


showed how to evaluate this (nebulously defined) integral when one knows 
the antiderivative F of f. 

However, Riemann was interested in applying integration to functions 
of one variable where the antiderivative was not known, and to functions in 
number theory or in general to those functions that “one need not find in 
nature.” 

Cauchy had already known that all continuous functions could be 
integrated and that the fundamental theorem was valid—that is, every 
continuous function had an antiderivative. However, his proofs were not 
entirely rigorous. For applications to number theory and to certain series 
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(called Fourier series), Riemann needed a clear, precise definition of the 
integral, which he presented in a paper in 1854. In this paper he defines his 
integral and gives necessary and sufficient conditions for a bounded function 
f to be integrable over an interval [a, b]. 

In 1876, the German mathematician Karl J. Thomae generalized 
Riemann'sintegral to apply to functions of several variables, as we do in this 
chapter. We further develop this approach in the Internet supplement. 

In the first half of the nineteenth century, Cauchy had observed that for 
continuous function of two variables, Fubini's theorem was valid. But 
Cauchy also gave an example of an unbounded function of two variables for 
which the iterated integrals were not equal. In 1878, Thomae gave the first 
example of a bounded function of two variables where one iterated integral 
exists and the other does not. In these examples, the functions were not 
“Riemann integrable” in the sense described in this section. Cauchy and 
Thomae’s examples demonstrated that one must apply caution and not 
necessarily assume that iterated integrals are always equal. 

In 1902, the French mathematician Henri Lebesgue developed a truly 
sweeping generalization of the Riemann integral. Lebesgue’s theory allowed 
integration of vastly more functions than did Riemann’s approach. Perhaps, 
unforeseen by Lebesgue, his theory was to have a profound impact on the 
development of many areas of mathematics in the twentieth century—in 
particular the theory of partial differential equations. Mathematics students 
go into more depth about the Lebesgue integral in their first year of graduate 
study. 

In 1907, the Italian mathematician Guido Fubini used the Lebesgue 
integral to state the most general form of the theorem on the equality of 
iterated integrals, the form that is studied today and used by working 


mathematicians and scientists in their research. 


EXERCISES 
1. Evaluate each of the following integrals if R = [0, 1] x [0, 1]. 


(a) Il @ 4 y?) dA (c) fI. (xy) cosx? d A 


(b) If, ye? dA (d) If In [(x + (y + 1] dA 


2. Evaluate each of the following integrals if R = [0, 1] x [0, 1]. 
(a) If (x" y") dx dy, where m,n > 0 (c) i) sin (x + y)dx dy 
R R 


(b) [| e 0a (d) [[ e mma 
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3. Compute the volume of the region over the rectangle [0, 1] x [0, 1] and under the graph 
ofz — xy. 


4. Compute the volume of the solid bounded by the xz plane, the yz plane, the xy plane, 
the planes x = 1 and y = 1, and the surface z = x? + y^. 


5. Let f be continuous on [a, b] and g continuous on [c, d]. Show that 


b d 
f [rosone - soa [| [ sow 


where R = [a, b] x [c,d ]. 


6. Compute the volume of the solid bounded by the surface z — sin y, the planes x — 1, 
x = 0, y = 0, and y = 7/2, and the xy plane. 


7. Compute the volume of the solid bounded by the graph z — x? 4- y, the rectangle 
= [0, 1] x [1, 2], and the “vertical sides" of R. 


8. Let f be continuous on R = [a, b] x [c, d]; fora « x < b,c « y « d, define 


F(x,y)- [ [ rema 


Show that 3? F/3x dy = 0? F/0y ax = f(x, y). Use this example to discuss the relationship 
between Fubini's theorem and the equality of mixed partial derivatives. 


9. Let f: [0, 1] x [0, 1] — R be defined by 


1 x rational 


2y x irrational. 


| 


Show that the iterated integral i [ i fox. y) dy] dx exists but that f is not integrable. 
10. Express ffp cosh xy dx dy as a convergent sequence, where R = [0, 1] x [0, 1]. 
11. Although Fubini’s theorem holds for most functions met in practice, one must still 


exercise some caution. This exercise gives a function for which it fails. By using a 
substitution involving the tangent function, show that 


2m HY yet 
[Lappe i LI CET S 


Why does this not contradict Theorem 3 or 3? 


12. Let f be continuous, f > 0, on the rectangle R. If ff, f dA =0, prove that f = 0 on R. 
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5.3 The Double Integral Over More General Regions 


Our goal in this section is twofold: First, we wish to define the double integral of a 
function f(x, y) over regions D more general than rectangles; second, we want to 
develop a technique for evaluating this type of integral. To accomplish this, we shall 
define three special types of subsets of the xy plane, and then extend the notion of 
the double integral to them. 


Elementary Regions 


Suppose we are given two continuous real-valued functions $i: [a,b] > R and 
2: [a,b] — R that satisfy ¢)(x) < ¢2(x) for all x € [a, b]. Let D be the set of 
all points (x, y) such that x € [a, b] and $1(x) < y < (x). This region D is said to 
be y-simple. Figure 5.3.1 shows various examples of y-simple regions. The curves 
and straight-line segments that bound the region together constitute the boundary 
of D, denoted dD. We use the phrase y-simple because the region is described in a 
relatively simple way, using y as a function of x. 


Y= 20%) 


Figure 5.3.1 Some y-simple regions. 


We say that a region D is x-simple if there are continuous functions y and y2 
defined on [c, d] such that D is the set of points (x, y) satisfying 


yele,d] and  yi(y)<x < yy) 


where yi(y) < vo(y) for all y € [c,d]. Again, the curves that bound the region 
D constitute its boundary 3D. Some examples of x-simple regions are shown in 
Figure 5.3.2. In this situation, x is the distinguished variable, given as a function of 
y. Thus, the phrase x-simple is appropriate. 

Finally, a simple region is one that is both x- and y-simple; that is, a simple region 
can be described as both an x-simple region and a y-simple region. An example of a 
simple region is a unit disk (see Figure 5.3.3). 

Sometimes we will refer to any of the regions as elementary regions. Note that 
the boundary 3D of an elementary region is the type of set of discontinuities of a 
function allowed in Theorem 2. 


Figure 5.3.2 Some x-simple regions. 


»-é)- TTE PRORA 


(a) (b) 


Figure 5.3.3 The unit disk, a simple region: (a) as a y-simple region, and (b) as an 
x-simple region. 


The Integral over an Elementary Region 


We can now use an interesting "trick" to extend the definition of the integral from 
rectangles to elementary regions. 


DEFINITION: Integral over an Elementary Region If D isan elementary 
region in the plane, choose a rectangle A that contains D. Given f: D > R, 
where f is continuous (and hence bounded), define ff, p J (x, y) dA, the integral 
of f over the set D as follows: Extend f to a function f* defined on all of 
R by 


d if (x, y)e D 
0 


foy) - if (x, y) d D and (x, y) € R. 


Note that f* is bounded (because f is) and continuous except possibly on the 
boundary of D (see Figure 5.3.4). The boundary of D consists of graphs of 
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continuous functions, and so f* is integrable over R by Theorem 2, Section 5.2. 
Therefore, we can define 


J| (e 247 ff ravaa 


When f(x, y) > 0 on D, we can interpret the integral ffp f(x, y) dA as the volume 
of the three-dimensional region between the graph of f and D, as is evident from 
Figure 5.3.4. 


Graph of z =f (x, y) Graph of /* 


i 


x Elementary region 
(a) (b) 
Figure 5.3.4 (a) Graph ofz = f(x, y) over an elementary region D. (b) Shaded 
region shows graph of z = f*(x, y) on some rectangle R containing D. From this 


picture we see that boundary points of D may be points of discontinuity of f*, 
because the graph of z = f*(x, y) can be broken at these points. 


We have defined ff, f(x, y) dx dy by choosing a rectangle R that encloses D. 
It should be intuitively clear that the value of ff^, f(x, y) dx dy does not depend on 
the particular R we select; we shall demonstrate this fact at the end of this section. 


Reduction to Iterated Integrals 


If R = [a, b] x [c, d] is a rectangle containing D, we can use the results on iterated 
integrals in Section 5.2 to obtain 


f(x. y)dA- fe, y)dA= ' " Feiss 
D R a c 


=f" [re odas. 


where f* equals f in D and zero outside D, as before. Assume that D is a 
y-simple region determined by functions $;:[a, b] > R and $»:[a, b] > R. 
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Consider the iterated integral 


[ [ re ooa. 


and, in particular, the inner integral Jt a f" (x, y)dy for some fixed x (Figure 5.3.5). 
By definition, f*(x, y) = 0 if y < $i(x) or y > $»(x), so we obtain 


d a(x) r(x) 
[rena [^ rear [ore nay 
c [7163] (x) 


Figure 5.3.5 The region between two graphs—a y-simple 
region. 


We summarize what we have obtained in the following. 


THEOREM 4: Reduction to Iterated Integrals If D is a y-simple 
region, as shown in Figure 5.3.5, then 


b phx) 
ff renar) [7 so.nayas. 
D a Jo(x) 


In the case f(x, y) = 1 for all (x, y) € D, ffp f(x. y) d A is the area of D. On 
the other hand, in this case, the right-hand side of formula (1) becomes: 


b d»(x) b 
f [fo aras = [ teo - eas = 40), 
a J$i(x) a 


which is the formula for the area of D learned in one-variable calculus. Thus, formula 
(1) checks in this case. 
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DOW s Find ff, Gy + cosx) dA, where T is the triangle consisting of 
all points (x, y) such that 0 <x < 7/2,0 < y € x. 


SOLUTION Referring to Figure 5.3.6 and formula (1), we have 


n/2 px 
ji (y + cosx) dA sÍ f (x3y + cosx) dy dx 
T 0 0 


z/2p,3,2 x z/l 75 
=f [+ eyes] a= f (5 + xcosx) ax 
0 2 y=0 0 2 
7/2 


x672 n/2 zÉ 
= | — +f (x cosx)dx = — —— + [x sinx + cos x]; 
12 Jo 0 (12)(64) 


Figure 5.3.6 A triangle T represented as a y-simple 
region. 


In the next example, we use formula (1) to find the volume of a solid whose base 
is a nonrectangular region D. 


70148974 Find the volume of the tetrahedron bounded by the planes 
y=0,z=0,x = 0, and y — x +z = 1 (Figure 5.3.7). 


SOLUTION We first note that the given tetrahedron has a triangular base D whose 
points (x, y) satisfy —1 < x < 0 and 0 < y < 1 + x; hence, D is a y-simple region. 
In fact, D is a simple region; see Figure 5.3.8. 

For any point (x, y) in D, the height of the surface z above (x, y) is 1 — y + x. 
Thus, the volume we seek is given by the integral 


Ifo —y-x)dA. 
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Figure 5.3.7 A tetrahedron bounded by the planes 
y =0,z=0,x =0,andy—x+z=1. 


Figure 5.3.8 The base of the tetrahedron in Figure 5.3.7 
represented as a y-simple region. 


go(x)=xt1 


(0, 0) 


(71,0) $10) = 0 


Using formula (1) with $;(x) = 0 and ¢2(x) = x + 1, we have 


0 plex 0 24i 
If a-y+»d4= | I (1 -»4yds- f o5 dx 
D -1 Jo E 2 Jy=0 
0 0 
feste - [eet ena 
sd 2 m 


6 6 
UNCTUS] Let D be a y-simple region. Describe its area A(D) as a limit of 


Riemann sums. 


SOLUTION If we recall the definition, A(D) = ff, dx dy is the integral over 
a containing rectangle R of the function f = 1. A Riemann sum S$, for this in- 
tegral is obtained by dividing A into subrectangles and forming the sum S, = 
bann f" (ej) Ax Ay, as in formula (1) of Section 5.2. Now f*(¢;x) is 1 or 0, de- 
pending on whether or not ¢;, is in D. Consider those subrectangles Rj, that have 
nonvoid intersection with D, and choose cj; in D N Rjg. Thus, S, is the sum of the 
areas of the subrectangles that meet D and A(D) is the limit of these as n > co. 
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Thus, A(D) is the limit of the areas of the rectangles “circumscribing” D. The reader 
should draw a figure to accompany this discussion. A 


The methods for treating x-simple regions are entirely analogous. Specifically, 
we have the following. 


THEOREM 4*: Iterated Integrals for x-Simple Regions Suppose 
that D is the set of points (x, y) such that y € [c, d] and yi(y) < x < Wo(y). If 
f is continuous on D, then 


f T fæ. 4 = [ i [ fe fasas] dy. o 


To find the area of D, we substitute f = 1 in formula (2); this yields 


Jf a-f “(a — Hoey. 


Again, this result for area agrees with the results of single-variable calculus for the 
area of a region between two curves. 

Either the method for y-simple or the method for x-simple regions can be used 
for integrals over simple regions. 

It follows from formulas (1) and (2) that ff, p J d A is independent of the choice 
of the rectangle R enclosing D used in the definition of Sf; p J dA, because, if we had 
picked another rectangle enclosing D, we would have arrived at the same formula (1). 


EXERCISES 


1. Evaluate the following iterated integrals and draw the regions D determined by the 
limits. State whether the regions are x-simple, y-simple, or simple. 


1 px? 1 pet 
(a) i, [ dy dx (c) [ f (x +y)dydx 
o Jo o Ji 
2 p3xtl 1 px? 
(b) f f dydx (d) [ [ ydydx 
1 J2x 0 Jx3 


2. Evaluate the following integrals and sketch the corresponding regions. 


2 py? 7/2 peosx 
(a) f f (x? + y) dx dy (d) [ f y sinx dydx 
—3 J0 0 0 


l pkl E 
(b) i) f e dy dx (e) [ f (x"+y")dxdy, m,n>0 
EN E o Jy 


2Ix| 


1 eü—2)72 0 p2ü—2)/2 
of I, dydx of f xdydx 
o Jo -1 Jo 
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3. Use double integrals to compute the area of a circle of radius r. 

4. Using double integrals, determine the area of an ellipse with semiaxes of length a and b. 

5. Whatis the volume of a barn that has a rectangular base 20 ft by 40 ft, vertical walls 30 ft 
high at the front (which we assume is on the 20-ft side of the barn), and 40 ft high at the rear? 


The barn has a flat roof. Use double integrals to compute the volume. 


6. Let D be the region bounded by the positive x and y axes and the line 3x + 4y = 10. 


Compute 
I G? +y*)dA. 
D 


7. Let D be the region bounded by the y axis and the parabola x = —4y? + 3. Compute 


Il xy dx dy. 
D 
1px 


8. Evaluate (x? + xy — y?) dy dx. Describe this iterated integral as an integral over 


2 


o Jo 
a certain region D in the xy plane. 


9. Let D be the region given as the set of (x, y) where 1 < x? + y? < 2 and y > 0. Is D an 
elementary region? Evaluate ff^, f(x, y) dA where f(x, y) = 1 4- xy. 


10. Use the formula A(D) = I dx dy to find the area enclosed by one period of the sine 
function sin x, for 0 < x < 2x, and the x axis. 


11. Find the volume of the region inside the surface z = x? + y? and between z = 0 and 
z — 10. 


12. Setup the integral required to calculate the volume of a cone of base radius r and 
height h. 


13. Evaluate ff, y d A where D is the set of points (x, y) such that 0 < 2x/z < y, y < sinx. 


b pd b d 
14. From Exercise 5, Section 52, f f S(x)g(y) dy dx = (f fdr) (f 2034). 


b d2(5) 
Is it true that ff, f(x)g(y) dx dy = (f f(x) dx) (f g(y) ay) for y-simple regions? 
a $i(a) 
15. Let D be a region given as the set of (x, y) with —-@(x) < y € ¢(x) anda <x <b, 
where ¢ is a nonnegative continuous function on the interval [a, b]. Let f(x, y) be a function 
on D such that f(x, y) = — f(x, —y) for all (x, y) € D. Argue that ff, f(x, y)dA = 0. 


16. Use the methods of this section to show that the area of the parallelogram D determined 
by two planar vectors a and b is |a;b2 — a»b;|, where a = a;i + aj and b = bii + bjj. 


17. Describe the area A(D) ofa region as a limit of areas of inscribed rectangles, as in 
Example 3. 
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5.4 Changing the Order of Integration 


Suppose that D is a simple region—that is, it is both x-simple and y-simple. Thus, it 
can be given as the set of points (x, y) such that 


axxsb, ġ@)<y <x), 
and also as the set of points (x, y) such that 
esysd, Wy) <x <p). 


Hence, we have the formulas 


b phx) d p(y) 
J| (e 24- [ se.navax= | [ f(x. y)dx dy. 


h(x) 1») 


If we are required to compute one of the preceding iterated integrals, we may 
do so by evaluating the other iterated integral; this technique is called changing the 
order of integration. It can be useful to make such a change when evaluating iterated 
integrals, because one of the iterated integrals may be more difficult to compute than 
the other. 


| IGH 95) By changing the order of integration, evaluate 


a p(a?—x2)"? 
[ i (a? — y?) ? dy dx. 
o Jo 


SOLUTION Note that x varies between 0 and a, and for each such fixed x, we have 
0 < y < (a? — x2)? Thus, the iterated integral is equivalent to the double integral 


I (a? — y?) ? dy dx, 
D 
where D is the set of points (x, y) suchthat0 < x < a and0 < y < (a? — x’)? But 


this is the representation of one quarter (the positive quadrant portion) of the disk of 
radius a; hence, D can also be described as the set of points (x, y) satisfying 


O<y<a, Ozxz(d-yy^? 


(see Figure 5.4.1). Thus, 


a p(a2—x2)"/2 à (qd yh) 
f [ P — yy ade = [ p (a — y3)! “l à 
o Jo o 


=f [x(a? — yy -y e 


344 2a 
-fe WE Toa 
0 0 
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(0, a) x= ai yl 


or 
ys d xl 


Figure 5.4.1 The positive-quadrant portion of a disk of 
radius a. 


(a, 0) 


We could have evaluated the initial iterated integral directly, but, as the reader can 
easily verify, changing the order of integration makes the problem simpler. The next 
example shows that it may not be obvious how to evaluate an iterated integral, and 


yet it may be relatively simple to evaluate the iterated integral obtained by changing 
the order of integration. 


EXAMPLE 2 


Evaluate 


2 plogx 
f i (x - DV14 e» dydx. 
1 Jo 


SOLUTION  Itwill simplify matters if we first interchange the order of integration. 


First notice that the integral is equal to [fp (x — 1)V1 + e?"d A, where D is the set 
of (x, y) such that 


1x x2 and 0 < y < logx. 
The region D is simple (see Figure 5.4.2) and can also be described by 


0 < y < log2 and esx <2, 


Figure 5.4.2 D is the region of integration for Example 2. 


Io 
un 
= 
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Thus, the given iterated integral is equal to 


log2 


log2 2 
d [ (x — DV 1-4- e dx dy 


log2 


vir | f'e Das as 
"ives x] dy 


log2 / o2 
-— iy ( 5 e) /1 l ey dy 
0 


1 kee, 


log2 
im 1 taray [7 e 14 e» dy. (1) 


In the first integral in expression (1), we substitute u = e? and in the second, v = e". 
Hence, we obtain 


174 2 
zi Vitudu+ [ V1 v? dv. (2) 
1 1 


Both integrals in expression (2) are easily found with techniques of one-variable 
calculus (or by consulting the table of integrals at the back of the book). For the first 
integral, we get 


[ j^ 1 e i 1 
id VI +udu = E + w^) = gg 4)? — 23/7) = qi” — 23/7), (3) 
1 


The second integral is 


l [evi E + og VIF +) 
= 5 [5 + toe (v5 +3] - 5 [v2 + 10g (v2 +] (4) 


2 
n Vl+vdv= 
I 


"3 


(see formula 43 in the table of integrals at the back of the book). Finally, we subtract 
equation (3) from equation (4) to obtain the answer 


4542 l2. 53/2 
G5 424 tog 377) z5 271 A 
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Mean Value Inequality 


We conclude with an inequality that helps us estimate integrals. Suppose there are 
numbers m and M such that forall (x, y) € D,andm < f(x, y) € M,thenintegrating 
over D, we get 


m+ A(D) < Í T fc, y)dA < M- A(D), () 


where A(D) is the area of the region D. Even though this inequality is obvious, it can 
help us estimate integrals that we cannot easily evaluate exactly. 


Consider the integral 


dx dy, 


1 
ll ees 


where D is the unit square [0, 1] x [0, 1]. Because the integrand satisfies, for x and 
y between 0 and 1, 


1 1 
=, 2 = — gli 
437 VI x6 yt 


and because the square has area 1, we get: 


1 1 

— —— dx d l. A 
ol) crm "is 

Mean Value Equality 


The mean value inequality can be turned into an equality when f is continuous. Here 
is the formal statement. 


THEOREM 5: Mean Value Theorem: Double Integrals Suppose 
f: D — R is continuous and D is an elementary region. Then for some point 
(Xo. yo) in D we have 


Í " fle. y)dA = f Gro. WALD), 


where A(D) denotes the area of D. 


PROOF We cannot prove this theorem with complete rigor, because it requires 
some concepts about continuous functions not proved in this course; but we can 
sketch the main ideas that underlie the proof. 

Because f is continuous on D, it has a maximum value M and a minimum 
value m. Thus, m < f(x, y) € M for all (x, y) € D. Furthermore, f(xi, yi) =m 
and f(x», y2) = M for some pairs (xi, yı) and (x2, y2) in D. 
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Dividing through inequality (5) by A(D), we get 


1 
ns xm fff ^e ene © 


Because a continuous function on D takes on every value between its max- 
imum and minimum values (this is the two-variable intermediate value theorem 
proved in advanced calculus; see also Review Exercise 32), and because the number 
[1/A(D)] [fp f x, y) dA is, by inequality (6), between these values, there must be a 
point (xo, yo) € D with 


Sæm) = T, for, y)d A, 


3n 


which is precisely the conclusion of Theorem 5. m 


EXERCISES 


1. In the following integrals, change the order of integration, sketch the corresponding 
regions, and evaluate the integral both ways. 


1 pl 
(a) [ Í xy dy dx 
A cosó 
(b) f f cos @ dr dO 
o Jo 
| p2-y 
(c) f f (x t yy dx dy 
o Ji 


b py 
(d) [ f f(x, y)dx dy (express your answer in terms of antiderivatives). 
a Ja 


2. Find 
1 pl 4 pog 
e f (x + yy dx dy (c) j f e dxdy 
-1 syl 0 Jy 
VO -y2) 1 px/4 " 
b f Í x! dx d. d f f sec’ x) dx d. 
(b) LaL y (d) ) a. )dx dy 


3. If f(x, y) =e" and D = [—x, x] x [77., 71], show that 


4. Show that 


sinx 
goes « ff dxdy <1 
{0.1}xqo.1) | + IE 


5. If D = [-1, 1] x [-1, 2], show that 


is ff dx dy PT 
dpe ye 17 
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6. Using the mean value inequality, show that 
1 i] i] dA 1 
= ears a 
67 JJp y—xX+3~ 4 
where D is the triangle with vertices (0, 0), (1, 1), and (1, 0). 


7. Compute the volume of an ellipsoid with semiaxes a, b, and c. (HINT: Use symmetry and 
first find the volume of one half of the ellipsoid.) 


8. Compute I f(x, y) d A, where f(x, y) = y? /x and D is the set of (x, y) where x > 0, 
D 
y » x?, and y < 10 — x?. 


9. Find the volume of the region determined by x? + y? + z? < 10, z > 2. Use the disk 
method from one-variable calculus and state how the method is related to Cavalieri's principle. 


10. Evaluate jj e*~ dx dy, where D is the interior of the triangle with vertices (0, 0), 
(1,3), and Q, 2). 

11. Evaluate f | y + y?y?? dx dy, where D is the region determined by the conditions 
— af sl 

12. Given that the double integral If, f(x, y) dx dy of a positive continuous function f 
equals the iterated integral [ | i) : f(x, y) av] dx, sketch the region D and interchange the 


order of integration. 


13. Given that the double integral If f(x, y) dx dy of a positive continuous function f 
D 
i fa-à 


b 
interchange the order of integration. 


b pb b 2 
14. Prove tatz f [ F(x) f(y») dy dx = (f sœ ax) . [Hinr: Notice that 


b 2 
d. = dx dy. 
( [ fe) 3 I Í span {OLDE 


15. Show that (see Exercise 27, Section 2.5) 


4 f [ tevais f rosas [ [mma 


1 
equals the iterated integral f | S(x,y) ^| dy, sketch the region D and 
0 


5.5 The Triple Integral 


Triple integrals are needed for many physical problems. For example, if the temper- 
ature inside an oven is not uniform, determining the average temperature involves 
“summing” the values of the temperature function at all points in the solid region 
enclosed by the oven walls and then dividing the answer by the total volume of the 
oven. Such a sum is expressed mathematically as a triple integral. 


un 
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Definition of the Triple Integral 


Our objective now is to define the triple integral of a function f(x, y, z) over a box 
(rectangular parallelepiped) B = [a, b] x [c,d] x [p,q]. Proceeding as in double 
integrals, we partition the three sides of B into n equal parts and form the sum 


n-l n—1 n-1 


Sr = YY figo ^F, 


i=0 j=0 k=0 


where ¢;;, is a point in Bijz, the ijkth rectangular parallelepiped (or box) in the 
partition of B, and AV is the volume of B;;, (see Figure 5.5.1). 


Figure 5.5.1 A partition of a box B into n? 
subboxes Bjj,. 


DEFINITION: Triple Integrals Let f be a bounded function of three vari- 
ables defined on B. If limit. ,45 Sn = S exists and is independent of any choice 
of c;;,, we call f integrable and call S the triple integral (or simply the integral) 


of f over B and denote it by 


f dV, fx, y,z)dV or fx, y, z) dx dy dz. 
B B B 


Properties of Triple Integrals 


As before, one can prove that continuous functions defined on B are integrable. More- 
over, bounded functions whose discontinuities are confined to graphs of continuous 
functions [such as x = (y, z), y = B(x, z), orz = y(x, y)] are integrable. The other 
basic properties (such as the fact that the integral of a sum is the sum of the integrals) 
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for double integrals also hold for triple integrals. Especially important is the reduction 
to iterated integrals: 


Reduction to Iterated Integrals Let f(x, y, z) be integrable on the box 8 — 
[a, b] x [c, d] x [p, q]. Then any iterated integral that exists is equal to the triple 
integral; that is, 


f[[ reo astris f [P [roo tetra: 
-f [ [12922 
-SF [ feo mavacas, 

a Jp Jc 


and so on. (There are six possible orders altogether.) 


DEWI (a) Let B be the box [0, 1] x [—5,0] x [0, 1]. Evaluate 
if (x + 2y + 3z} dx dy dz. 
B 


(b) Verify that we get the same answer if the integration is done in the order y first, 
then z, and then x. 


SOLUTION (a) According to the principle of reduction to iterated integrals, this 
integral may be evaluated as 


1/3 p0 1 
f f [ &*2 9r anys 
0 -12 40 


M3 pO 3| 
= f f CEREO dade 
o Jaz 3 bao 


1/3 
=Í E. 3[C +2 32 - Qy 327 ] av az 


1/3 " à 0 
= 1+2y+3z 2y + 3z d: 
i zl y + 3z)* — Qy DM z 


-1/2 
sf 3z + 1)* — 2(3z)4 + Bz — 1* | d 
=f [6+ »* -262* +. 6z- 4] az 


EE isle + 1) — 282) + Gz — yl" 


Lb qus. => 
24. T? 2 
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(b) 
ll (x + 2y + 3z} dy dz dx 
B 


1 pl/3 pO 
«Ll J (x +2y + 3z} dy dz dx 
0 0 -1/2 
rr ( +2y +32) |? dzdx 
0 Jo 6 yes 1/2 
1 Pl 4 
=i | s[e +32) - 6 32 - vy ] az 
o Jo 6 
[ 1f] 325. œ+3z-1% [I]? " 
-— x 
o 6 12 12 T 
1 
1 4 | 4 4 
=Í =a [@ +1) E(x— 0 2x*]ax 
NI - s ost! 1 
slot n= Les la p ^ 


|EX SBS) Integrate e'^?** over the box [0, 1] x [0, 1] x [0, 1]. 
SOLUTION We perform the integrations in the standard order: 


1 pl pl 1 pl 
[ [ f et? dx dy dz s [ (e **|1 o dy dz 
o Jo Jo o Jo 


I pl 1 
=Í f e*-eo«- f [e - or] dz 
0 J0 0 y-0 
1 1 
=| [ee 2e 4 ela: [e 22** ee]. 
0 


-eg—-3e843e—-1-2(e—1y. A 


As in the two-variable case, we define the integral of a function f over a bounded 
region W by defining a new function f*, equal to f on W and zero outside W, and 
then setting 


[J| 0o.» arte = [ff ro mas aras, 


where B is any box containing the region W. 


Elementary Regions 


As before, we restrict our attention to particularly simple regions. An elementary re- 
gion in three-dimensional space is one defined by restricting one ofthe variables to be 
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between two functions of the remaining variables, the domains of these functions be- 
ing an elementary (i.e., an x-simple or a y-simple) region in the plane. For example, if 
D is an elementary region in the xy plane and if yı (x, y) and y2(x, y) are two functions 
with y2(x, y) > yı(x, y), an elementary region consists of all (x, y, z) such that (x, y) 
lies in D and y\(x, y) < z € y»(x, y). Figure 5.5.2 shows two elementary regions. 


z 
z= y(x, y) | z= yy y) 


z= y») 


Figure 5.5.2 Two elementary regions in space. The domain D in the figure on the 
left is y-simple, while on the right it is x-simple. 


919095) Describe the unit ball x? + y? + 2? < 1 as an elementary region. 
SOLUTION This can be done in several ways. One, in which D is y-simple, is: 


In doing this, we first write the top and bottom hemispheres as z = y1 — x? — y? 
andz = —y l — x? — y?, respectively, where x and y vary over the unit disk (that is, 
=v 1 — x? € y € V1 — x? and x varies between —1 and 1). (See Figure 5.5.3.) We 
can describe the region in other ways by interchanging the roles of x, y, and z in the 
defining inequalities. A 


z 


z=y,(x,y)=41-x7-y7 


y=$¢,@)=-J1-# y=) =V1—x2 Figure 5.5.3 The unit ball as an 


y elementary region in space. 


zzy (9) =—\1— x2 -y7 
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Integrals over Elementary Regions 


As with integrals in the plane, any function of three variables that is continuous over 
an elementary region is integrable on that region. An argument like that for double 
integrals shows that a triple integral over an elementary region can be rewritten as 
an iterated integral in which the limits of integration are functions. The formulas for 
such iterated integrals are given in the following box. 


Triple Integrals by Iterated Integration Suppose that W is an elementary 
region described by bounding z between two functions of x and y. Then either 


pax) pry) 
Ii Fees »zdxdydz= f [ i f(x, y, z)dzdy dx 
pa) riy) 


[see Figure 5.5.2 (left)] or 


d Way) prey) 
Il fG y, 2d dydz = Í f j f(x. y. z) dz dx dy 
Ww c Jyly) Syny) 


[see Figure 5.5.2 (right)]. 


If f = 1, we get the integral fff, dx dy dz, which is the volume of the region W. 


Verify the volume formula for the ball of radius 1: 


4 
I dxdydz = =n, 
w 3 


where W is the set of (x, y, z) with x? + y? +2? < 1. 


SOLUTION We use the description of the unit ball from Example 3. From the 
first formula in the preceding box, the integral is 


1 Nimm 
f {3 dz dy dx. 
=a/ 1=x?=y? 


Holding y and x fixed and integrating with respect to z yields 


LEBER Pepe 


Because x is fixed in the y-integral, it can be expressed as /^ (a? — y?) ? dy, 
where a = (1 — x*)'/?. This integral is the area of a semicircular region of radius 
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a, so that 


a al 
(a2 — 2) dy = on. 


—a 


(We could also have used a trigonometric substitution or a table of integrals.) Thus, 


a -2 -y)dy 
-VT 2 


T, 


| ud 1—x2 


and so 


pop Ix: i jag 
af f (1-2? =)! dydx =2 f - dx 
-1 Ji? id 2 
i x3 1 4 
sef (1 xs on (s- 3) 
-1 


==". å 
3 
Other types of elementary regions are shown in Figure 5.5.4. For instance, in the 
second region, (y, z) lies in an elementary region in the yz plane and x lies between 
two graphs: 


x--l 


pi. Z) € x € p(y, z). 


ja. Ja. ja. 


l f f 
Top and bottom are Front and rear are Left and right are 
surfaces z —y (x, y) surfaces x = p(y, z) surfaces y — ó(x, z) 


Figure 5.5.4 Types of elementary regions in space. 


As shown in Figure 5.5.5, some elementary regions can be simultaneously described 

in all three ways. We shall call these regions symmetric elementary regions. 
Corresponding to each description of a region as an elementary region is an 

integration formula. For instance, if W is expressed as the set of all (x, y, z) such that 


cexyxd, wWiy)zzzwW»».  P(y,z) <x < p(y:z), 
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Figure 5.5.5 A symmetric 
elementary region can be 
described in three overall ways. 


eo o 


then 


d p(y) pez) 
ll fer y dx dydz = [| f f f(x, y, z) dx dz dy. 
Ww c JyaQ) “pi(y,2) 


Daw N55) Let W be the region bounded by the planes x = 0, y = 0, and 
z = 2, and the surface z = x? + y? and lying in the quadrant x > 0, y > 0. Compute 


Hi x dx dy dz and sketch the region. 
Ww 


SOLUTION Method 1. The region W is sketched in Figure 5.5.6. As indicated 
in the figure, we may describe this region by the inequalities 


O<x<v2, 0<y<V2—-x?,) x+y? <2 <2. 


Therefore, 


v2 J2-x? 2 
"i xdxdydz = Í i (f xdz) dy dx 
Ww 0 0 x+y? 
V2 eJ 2—x? 
- | j x(2—x? — y*)dy dx 

o Jo 
42 _ y23/2 

f fe xP e ; ) IE 
0 


42 245/2 
=f =e 2) dx = —2(2 = y 


v2 


0 


25⁄2 gy2 
15 15" 
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Figure 5.5.6 W is the region below the plane z — 2, 
above the paraboloid z = x? + y?, and on the positive 
sides of the planes x — 0, y — 0. 


Method 2. We can also place limits on x first and describe W by 0 € x < 
(z — y?)!? and (y, z) in D, where D is the subset of the yz plane with 0 < z < 2 
and 0 < y < z!/? (see Figure 5.5.7). 


Figure 5.5.7 A different description of the region in 
Example 5. 


Therefore, 


E=? 
Ii xdxdyde = [f (f de) dydz 
w D \Jo 
2 22 E-z)" 
= [i (f xdx) dy| az 
o LJo 0 
2 zi 2 
z—y 
= dy dz 
LL (=e 
2 3/2 2 
al P Na al 2 32a; 
2 Jo 3 2 Jy 3 


2 
2 sn 2 sn. 82 
v. 15 15 " 


which agrees with our previous answer. A 
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S328 Evaluate 


1 px 2 
[ hi [ dzdydx. 
0 Jo Jy 


Sketch the region W of integration and interpret. 
SOLUTION 


1 x 2 1 x 
ITI dzdydx= | f @-x-y)dydx 
0 JO Jx?+y? 0 J0 
! x? 1 1 2 
- 2 3 dx=1-- LI. 
[Ge x J * 4 1^3 


This integral is the volume of the region sketched in Figure 5.5.8. A 


Figure 5.5.8 The region W lies between the 
paraboloid z = x? + y? and the plane z = 2, 
and above the region D. 


EXERCISES 


In Exercises 1 to 4, perform the indicated integration over the given box. 


1 ff[ oiii st xi meten 
ü J[[ 7 o =0, 1] x [0,1] x [0, 1] 
> f[[2 «aeree B = [0,2] x [-1, 1] x [0, 1] 
f[[ imm = (0, 1] x [0.1] x [0, 1] 


In Exercises 5 to 8, describe the given region as an elementary region. 


5. The region between the cone z = x? + y? and the paraboloid z = x? + 5?. 
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6. The region cut out of the ball x? + y? + z? < 4 by the elliptic cylinder 2x? + z? = 1, 
that is, the region inside the cylinder and the ball. 


7. The region inside the sphere x? + y? + z? = 1 and above the plane z = 0. 


8. The region bounded by the planes x = 0, y 20,2 = 0, x + y = 4, and x =z —y— 1. 


Find the volume of the region in Exercises 9 to 12. 
9. The region bounded by z = x? + y? and z = 10 — x? — 2y?. 
10. The solid bounded by x? + 2y? = 2, z = 0, and x + y +2z = 2. 
11. The solid bounded by x = y, z = 0, y =0,x = l, and x + y +z = 0. 


12. The region common to the intersecting cylinders x? + y? < a? and x? 4- z? <a’. 


Evaluate the integrals in Exercises 13 to 21. 


1 2 3 
13. f j. f cos [zr(x + y + z)]dx dy dz 
0 1 2 


1px py 
. Í f f (y 4- xz) dz dy dx 
o Jo Jo 


15. Ii (x? + y? + 2°) dx dy dz; W is the region bounded by x + y +z = a (where 
w 
a> 0j x =0,y —0,andz = 0. 


16. Hi z dx dy dz; W is the region bounded by the planes x = 0, y = 0,z = 0,z = 1, 
Ww 
and the cylinder x? + y? = 1, with x > 0, y > 0. 


17. I x? cosz dx dy dz; W is the region bounded by z = 0, z = zr, y = 0, 
w 
y=1,x=0,andx+y=1. 


2 px pty 
18. f l | dz dy dx 
o Jo Jo 


19. "i (1 — z?) dx dy dz; W is the pyramid with top vertex at (0, 0, 1) and base vertices 
w 
at (0, 0, 0), (1, 0, 0), (0, 1, 0), and (1, 1, 0). 


20. I E y)dx dy dz; W is the same pyramid as in Exercise 19. 
w 


1 pdx px+y 
21. [ f [ dz dy dx. 
0 Jo xy? 


22. (a) Sketch the region for the integral a [ f(x, y. z) dz dy dx. 
(b) Write the integral with the dutégration order dx dy dz. 
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For the regions in Exercises 23 to 26, find the appropriate limits p(x), $»(x), ya(x, y), and 
y2(x, y), and write the triple integral over the region W as an iterated integral in the form 


fff i" -[ US LES esos 


23. W = (Gy, z) | Vx? +y? <z <1} 


24. W = {(x,y,z)| <z < l andx? +y? oz <1} 


1 
25. W = (x, yz) | x^ +y? < 1,z > Oand x? + y? 42? « 4] 

26. W = (x, yz) | lx] < Lb |y] < 1,z > Oandx? +y? +2? < 1} 

27. Show that the formula using triple integrals for the volume under the graph of a positive 
function f(x, y), on an elementary region D in the plane, reduces to the double integral of f 


over D. 


28. Let W be the region bounded by the planes x = 0, y = 0, z = 0, x + y = 1, and 
z=x +y. 


(a) Find the volume of W. 
(b) Evalute fff, x dx dy dz. 
(c) Evalute fff, y dx dy dz. 


29. Let f be continuous and let B, be the ball of radius £ centered at the point (xo, yo, zo). 
Let vol (B,) be the volume of B,. Prove that 


S > 
im ap ATT, f(x, y, z) dV = f(xo, Yo, zo). 


REVIEW EXERCISES FOR CHAPTER 5 


Evaluate the integrals in Exercises 1 to 4. 


3 n 1 pet 
1. f f xydydx 3. [ [ xInydydx 
0 J—x241 0 Je 


lel 1 p2 73 
2 J f (x + yy dy dx 4. [ f [ cos [zt (x + y + z)] dx dy dz. 
0 dye o Ji J2 


Reverse the order of integration of the integrals in Exercises 5 to 8 and evaluate. 
5. The integral in Exercise 1. 
6. The integral in Exercise 2. 
7. The integral in Exercise 3. 


8. The integral in Exercise 4. 
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9. Evaluate the integral f} fo f (y + xz)dz dy dx. 
10. Evaluate ja p e? dx dy. 
11. Evaluate f) [E y cos xy dx dy. 


12. Change the order of integration and evaluate 


2 pl 
[ (x + yy dx dy. 
0 Jy 


13. Show that evaluating ff, dx dy, where D is a y-simple region, reproduces the formula 
from one-variable calculus for the area between two curves. 


14. Change the order of integration and evaluate 
1 1 
f (x? + yx) dx dy. 
o Jyu2 


15. Let D be the region in the xy plane inside the unit circle x? 4- y? — 1. Evaluate 
Ip f x, y) dx dy in each of the following cases: 


(a) fœ y) — xy (b) fœ, y) 2 xy © f@, y) =y 


16. Find If» — cos (zx /4)] dx dy, where D is the region in Figure 5.R.1. 


pave 
Figure 5.R.1 The region of integration for Exercise 16. 


x 


Evaluate the integrals in Exercises 17 to 24. Sketch and identify the type of the region 
(corresponding to the way the integral is written). 


3 sinx 
17. n x(l + y)dy dx 
si 


nx 


x cos (1x /2) 
18. [ f. (x? +xy + l)dydx 
Q-y»? 
19. f. [ G J - 2y) dxdy 
2/3 
3(4/4-x2)/2 5 
f f (X + ») dydx 
sen N24 x 
1 x? 
21. [ y (x? + xy — y?) dy dx 
0 J0 


4 py 
22. f f 3dx dy 
2 y2—] 


20. 


> 
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23. [ I, (x+y) dy dx 
1 
24. LII P dx dy 
o Jo 
In Exercises 25 to 27, integrate the given function f over the given region D. 
25. f(x, y) 2x — y; D is the triangle with vertices (0, 0), (1, 0), and (2, 1). 
26. f(x, y) 5 x?y + cosx; D is the triangle defined by 0 € x < 1/2,0 € y < x. 


27. f(x, y) =x? + 2xy? + 2; D is the region bounded by the graph of y = —x? + x, the x 
axis, and the lines x — 0 and x — 2. 


In Exercises 28 and 29, sketch the region of integration, interchange the order, and evaluate. 
4 pve 
28. Í f (x? + y?) dy dx 
1 JI 
1 
29. f (x 4- y!) dx dy 
Q dy 


30. Show that 
4é < Í fi edd < 40%, 
[1.3]x[2.4] 


4r < i (x? +y? + D) dx dy < 207, 
D 


31. Show that 


where D is the disk of radius 2 centered at the origin. 

32. Suppose W is a path-connected region, that is, given any two points of W there is a 
continuous path joining them. If f is a continuous function on W, use the intermediate-value 
theorem to show that there is at least one point in W at which the value of f is equal to the 


average of f over W, that is, the integral of f over W divided by the volume of W. (Compare 
this with the mean-value theorem for double integrals.) What happens if W is not connected? 


33. Prove: Cu F(u)du]dt = hE — u)F(u) du. 


Evaluate the integrals in Exercises 34 to 36. 


1 pz py 

34. [ j [ xyz} dx dy dz 

35. Fn xg eae dy 

wef ff yz? dx dy dz 
1.41. Jury 


37. Write the iterated integral i T. ri f(x, y, z) dz dy dx as an integral over a region in 
R? and then rewrite it in five other possible orders of integration. 


Formula and Applications 
of Integration 
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If you are stuck in a calculus problem and don't know what else to do, try 
integrating by parts or changing variables. 


Jerry Kazdan 


If that fails, go away, have a cup of coffee, and think! 
Ute Müller 


he change of variables formula is one of the most powerful integration methods 
in single-variable calculus; it enables us to evaluate integrals such as 


1 
i xe" dx 
0 


by using the substitution, or change of variables u — x?, which reduces the problem 
to the easy task of integrating e" with respect to u. In this chapter, we develop the 
multidimensional change of variables formula, which is especially important and 
useful in evaluating multiple integrals in polar, cylindrical, and spherical coordinates. 

One of the key ingredients in the change of variables formula is how to change 
variables in multidimensions. This involves the notion of mapping, which occurs 
in various interesting situations. For example, consider a deforming object, such 
as a swimming fish. As it changes its shape, one can imagine the instantaneous 
correspondence between points on the fish in its rest state and in its current shape. 
This type of correspondence is, in fact, the main idea behind a change of variables, 
in this case, of one three-dimensional region (the fish in its rest state) to another (the 
fish in its current shape). 
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The first section in this chapter describes the key concepts for mappings between 
regions ofthe plane. It goes on to develop the change of variables technique for double 
and then triple integrals. The chapter also includes some of the important physical 
applications of the integral. 


6.1 The Geometry of Maps from R? to R? 


In this section, we shall be interested in maps from subsets of IR? to IR. The resulting 
geometric understanding will be useful in the next section, when we discuss the change 
of variables formula for multiple integrals. 


Maps of One Region to Another 


Let D* be a subset of IR?; suppose we consider a continuously differentiable map 
T: D* — R?, so T takes points in D* to points in R?. We denote the set of image 
points by D or by T(D*); hence, D = T(D*) is the set of all points (x, y) € R? such 
that 


(x, y) = T(x*, y*) for some (x*, y*) e D*. 


One way to understand the geometry ofa map T is to see how it deforms or changes 
D*. For example, Figure 6.1.1 illustrates a map T that takes a slightly twisted region 
into a disk. 


Figure 6.1.1 A function T from a region D* to a disk D. 


Let D* C R? be the rectangle D* = [0, 1] x [0, 27]. Then all 
points in D* are of the form (r, 0), where 0 <r < 1,0 <0 x 2m. Let T be the 
polar coordinate “change of variables" defined by T(r, 0) = (r cos0, r sin). Find 
the image set D. 


SOLUTION Let (x, y) = (r cos0, r sin). Because of the identity x? + y? = 
r? cos? 0 +r? sin? 0 =r < 1, we see that the set of points (x, y) € R? such that 
(x, y) € D has the property that x? + y? < 1, and so D is contained in the unit disk. 
In addition, any point (x, y) in the unit disk can be written as (r cos 0, r sin 0) for 
some 0 <r < l and 0 < 0 < 2x. Thus, D is the unit disk (see Figure 6.1.2). A 
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T: (r, 69) — (r cos 6, r sin 8) = (x, y) 


P ma 


Figure6.1.2 T gives a change of variables between Euclidean and polar 
coordinates. The unit circle is the image of a rectangle. 


AGN RRA Let T be defined by T(x, y) = ((x + y)/2, (x — y)/2) and let 
D* = [-1, 1] x [-1, 1] C R? bea square with side of length 2 centered at the origin. 
Determine the image D obtained by applying 7 to D*. 


SOLUTION  Letus first determine the effect of T on the line c; (t) = (t, 1), where 
—l1 €t <1 (see Figure 6.1.3). We have T(e(t)) = ((t + 1)/2, (t — 1)/2). The map 
t œ> T(ei(t)) isa parametrization of the line y = x — 1, 0 € x < 1, because (t — 1)/ 
2 = (t + 1)/2 — 1. This is the straight line segment joining (1, 0) and (0, — 1). 


C1, 1) (1, 1) 
Figure 6.1.3 Domain for the transformation T of 


C4 €; Example 2. 


C1 -1) (,-1) 


Let 
ext) = (1, t), EE 
ex(t) = (t, —1), -l<t<l 
c(t) 2(-1,), —-lst<l 


be parametrizations of the other edges of the square D*. Using the same argument 
as before, we see that T o c? is a parametrization of the line y = 1—x,0<x <1 


T(-1, -1) = C1, 0) 


T(e4) 
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[the straight line segment joining (0, 1) and (1, 0)]; T o ¢3 is the line y = x + 1, —1 < 
x < 0 joining (0, 1) and (—1,0); and T oc4 is the line y = —x —1,—1 <x <0 
joining (—1, 0) and (0, —1). By this time it seems reasonable to guess that 7 “flips” 
the square D* over and takes it to the square D whose vertices are (1, 0), (0, 1), 
(—1, 0), (0, —1) (Figure 6.1.4). 


T(1, -1) - (0, 1) 


N Figure 6.1.4 The effect of T on the region D*. 


— x 


ra, 1)- 0,0) 


To prove that this is indeed the case, let —1 < o < 1 and let Ly (Figure 6.1.3) be 
a fixed line parametrized by e(t) = (o, t), 1 < t < 1; then T(c(f)) = ((œ + £)/2, 
(a — t)/2) is a parametrization of the line y = —x + o, (a — 1)/2 € x < (œ + 1)/2. 
This line begins, for t = —1, at the point ((w — 1)/2, (1 + a)/2) and ends at the point 
((1 + @)/2, (œ — 1)/2); as is easily checked, these points lie on the lines T o c3 and 
T o ej, respectively. Thus, as œ varies between — 1 and 1, L sweeps out the square D* 
while T (L4) sweeps out the square D determined by the vertices (—1, 0), (0, 1), (1, 0), 
and(0, —1). A 


Images of Maps 


The following theorem is a useful way to describe the image T(D*). 


THEOREM 1 Let A be a 2 x 2 matrix with det A Æ 0 and let T be the linear 
mapping of R? to R? given by T(x) = Ax (matrix multiplication). Then T transforms 
parallelograms into parallelograms and vertices into vertices. Moreover, if T(D*) is 
a parallelogram, D* must be a parallelogram. 


The proof of Theorem 1 is left as Exercise 10 at the end of this section. This 
theorem simplifies the result of Example 2, because we need only find the vertices of 
T(D*) and then connect them by straight lines. 


One-to-One Maps 


Although we cannot visualize the graph of a function 7: IR? — R2, it does help 
to consider how the function deforms subsets. However, simply looking at these 
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deformations does not give us a complete picture of the behavior of T. We may 
characterize T further by using the notion of a one-to-one correspondence. 


DEFINITION A mapping T is one-to-one on D* if for (u, v) and (v, v) € D*, 
T(u, v) = T(u’, v’) implies that u = u’ and v = v". 


This statement means that two different points of D* are not sent into the same 
point of D by T. For example, the function T(x, y) = (x? + y?, y^) is not one-to-one, 
because T(1, —1) = (2, 1) = T(1, 1) and yet (1, —1) z (1, 1). 


DB] Consider the polar coordinate mapping function T: R? — R? 
described in Example 1, defined by T(r, 0) = (r cos 0, r sin@). Show that T is not 
one-to-one if its domain is all of R?. 


SOLUTION  If6; # 65, then T(0, 0) = T(0, 62), and so T cannot be one-to-one. 
This observation implies that if L is the side of the rectangle D* = [0, 1] x [0, 27] 
where 0 < 0 < 2x andr = 0 (Figure 6.1.5), then T maps all of L into a single point, 
the center of the unit disk D. However, if we consider the set S* = (0, 1] x [0, 277), 
then 7: S* — S is one-to-one (see Exercise 1). Evidently, in determining whether a 
function is one-to-one, the domain chosen must be carefully considered. 4 


Figure 6.1.5 The 
polar-coordinate 
transformation T takes 
the line Z to the point 
(0, 0). 


5855") Show that the function 7: R? — R? of Example 2 is one-to-one. 
SOLUTION Suppose T(x, y) = T(x’, y’); then 


x+y x-y = x+y ES 
2 * 2 i 2 °° 2 


and we have 


x4ycxy, 


x—yzcx -y. 


Onto Maps 
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Adding, we have 
2x = 2x'. 


Thus, x = x’ and, similarly, subtracting gives y = y', which shows that T is one-to- 
one (with domain all of R?). Actually, because T is linear and T(x) = Ax, where A 
is a2 x 2 matrix, it would also suffice to show that det 4 4 0 (see Exercise 8). A 


In Examples 1 and 2, we have been determining the image D = T(D*) of a region 
D* under a mapping 7. What will be of interest to us in the next section is, in part, 
the inverse problem: Namely, given D and a one-to-one mapping T of IR? to R?, find 
D* such that T(D*) = D. 

Before we examine this question in more detail, we introduce the notion of “onto.” 


DEFINITION The mapping T is onto D if for every point (x, y) € D there exists 
at least one point (u, v) in the domain of T such that 7(u, v) = (x, y). 


Thus, if T is onto, we can solve the equation 7 (u, v) = (x, y) for (u, v), given 
(x, y) € D. If T is, in addition, one-to-one, this solution is unique. 

For linear mappings T of IR? to IR? (or R” to R”) it turns out that one-to-one and 
onto are equivalent notions (see Exercises 8 and 9). 

If we are given a region D and a mapping T, the determination of a region D* 
such that 7(D*) = D will be possible only when for every (x, y) € D there is a (u, v) 
in the domain of 7 such that T (u, v) = (x, y) (that is, T must be onto D). The next 
example shows that this cannot always be done. 


EXAMPLE J Let T: R? — R? be given by T(u, v) = (u, 0). Let D be the 


square, D = [0, 1] x [0, 1]. Because 7 takes all of R? to one axis, it is impossible to 
find a D* such that T(D*) = D. 4 


Let us revisit Example 2 using these methods. 


EXAMPLE 6] Let T be defined as in Example 2 and let D be the square whose 


vertices are (1, 0), (0, 1), (—1, 0), (0, — 1). Find a D* with T(D*) — D. 


SOLUTION Because T is linear and T(x) = Ax, where A is a 2 x 2 matrix 
satisfying det A 4 0, we know that 7: R? — R? is onto (see Exercises 8 and 9), 
and thus D* can be found. By Theorem 1, D* must be a parallelogram. In order 
to find D*, it suffices to find the four points that are mapped onto vertices of D; 
then, by connecting these points, we will have found D*. For the vertex (1, 0) of 
D, we must solve T(x, y) = (1, 0) = ((x + y)/2, (x — y)/2), so that (x + y)/2 = 1, 
(x — y)/2 = 0. Thus, (x, y) = (1, 1) isa vertex of D*. Solving for the other vertices, 
we find that D* = [—1, 1] x [—1, 1]. This is in agreement with what we found more 
laboriously in Example 2. A 


U 
-l 
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DSSS 5 Sei) Let D be the region in the first quadrant lying between the arcs 
of the circles x? + y? = a?, x? + y? = b?, 0 <a < b (see Figure 6.1.6). These cir- 
cles have equations r — a and r — b in polar coordinates. Let 7 be the polar- 
coordinate transformation given by T(r, 0) = (r cos6, r sin 8) = (x, y). Find D* such 
that T(D*) = D. 


Figure 6.1.6 We seek a region D* in the 0r plane whose image 
under the polar-coordinate mapping is D. 


(a,0) (5,0) 


SOLUTION Inthe region D, a? < x? + y? < b?; and because r? = x? + y?, we 
see that a <r < b. Clearly, for this region 0 varies between 0 < 0 < 7/2. Thus, if 
D* = [a, b] x [0, 7/2], we have T(D*) = D and T is one-to-one. A 


REMARK The inverse function theorem discussed in Section 3.5 is relevant to the 
material here. It states that if the determinant of DT (uo, vo) [which is the matrix of 
partial derivatives of T evaluated at (uo, vo)] is not zero, then for (u, v) near (uo, vo) 
and (x, y) near (xo, yo) = T (uo, vo), the equation T(u, v) = (x, y) can be uniquely 
solved for (u, v) as functions of (x, y). In particular, by uniqueness, 7 is one-to-one 
near (uo, Vo); also, T is onto a neighborhood of (xo, yo), because 7 (u, v) = (x, y) is 
solvable for (u, v) if (x, y) is near (xo, yo). 

However, even if T is one-to-one near every point, and also onto, 7 need not be 
globally one-to-one. Thus, one must exercise caution (see Exercise 12). 


Surprisingly, if D* and D are elementary regions and T: D* — D has the prop- 
erty that the determinant of DT (u, v) is not zero for any (u, v) in D* and if T maps 
the boundary of D* in a one-to-one and onto manner to the boundary of D, then T is 
one-to-one and onto from D* to D. (This proof is beyond the scope of this text.) 

In summary, we have: 


One-to-One and Onto Mappings A mapping 7: D* — D is one-to-one 
when it maps distinct points to distinct points. It is onto when the image of 
D* under 7 is all of D. 


A linear transformation of R” to R” given by multiplication by a matrix A 
is one-to-one and onto when and only when det A Æ 0. 
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EXERCISES 
1. Let S* = (0, 1] x [0, 277) and define T(r, 0) = (r cos@, r sin@). Determine the image 
set S. Show that T is one-to-one on S*. 


2. Define 


n iecit ia 
Toy") = ( y *) 


42/742 
Show that T rotates the unit square, D* — [0, 1] x [0, 1]. 


3. Let D* = [0, 1] x [0, 1] and define 7 on D* by T(u, v) = (—u? + 4u, v). Find the 
image D. Is T one-to-one? 


4. Let D* be the parallelogram bounded by the lines y = 3x — 4, y 23x, y = ix, and 
y= i +4). Let D = [0, 1] x [0, 1]. Find a 7 such that D is the image of D* under T. 


5. Let D* = [0, 1] x [0, 1] and define T on D* by T(x*, y*) = (x*y*, x*). Determine the 
image set D. Is T one-to-one? If not, can we eliminate some subset of D* so that on the 
remainder T is one-to-one? 


6. Let D* be the parallelogram with vertices at (— 1, 3), (0, 0), (2, — 1), and (1, 2), and D be 
the rectangle D — [0, 1] x [0, 1]. Find a 7 such that D is the image set of D* under T. 


7. Let T: R? — R? be the spherical coordinate mapping defined by (p, $, 0) > (x, y, Z), 
where 
x = psingcosé, y =psing sing, z= pcos. 


Let D* be the set of points (p, $, 0) such that œ € [0, 7], 0 € [0, 27], p € [0, 1]. Find 
D = T(D"*). Is T one-to-one? If not, can we eliminate some subset of D* so that, on the 
remainder, 7 will be one-to-one? 
In Exercises 8 and 9, let T(x) = Ax, where A is a2 x 2 matrix. 

8. Show that 7 is one-to-one if and only if the determinant of A is not zero. 

9. Show that det A ¥ 0 if and only if T is onto. 
10. Suppose 7: R? — R? is linear and is given by T(x) = Ax, where A isa 2 x 2 matrix. 
Show that if det A # 0, then T takes parallelograms onto parallelograms. [HINT: The general 
parallelogram in IR? can be described by the set of points q = p + Av + uw for A, u € (0, 1) 


where p, v, w are vectors in R? with v not a scalar multiple of w.] 


11. Suppose 7: R? — R? is as in Exercise 10 and that T(P*) = P is a parallelogram. Show 
that P* is a parallelogram. 


12. Consider the map T: D — D, where D is the unit disk in the plane, given by 


T(r cos0, r sin@) = cos 26, r° sin 20). 
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Using complex notation, z = x + iy, the map T can be written as T(z) = z?. Show that the 
Jacobian determinant of T vanishes only at the origin. Thus, away from the origin, 7 is 
locally one-to-one. However, show that 7 is not globally one-to-one on R? minus the origin. 


6.2 The Change of Variables Theorem 


Given two regions D and D* in R?, a differentiable map T on D* with image D, that 
is, T(D*) — D, and any real-valued integrable function f: D — IR, we would like 
to express f, p f. y) d A as an integral over D* of the composite function f o T. In 
this section we shall see how to do this. 

Assume that D* is a region in the wv plane and that D is a region in the xy plane. 
The map T is given by two coordinate functions: 


T(u, v) = (x(u, v), y(u, v)) for (u, v) € D*. 


At first, one might conjecture that 


J| eaa f " J&t, v), yu, v) du d, a) 


where f o T(u, v) = f(x(u, v), y(u, v)) is the composite function defined on D*. 
However, if we consider the function f: D — IR? where f(x, y) — 1, then equation 


(1) would imply 
A(D) = If, dx dy - f. du dv = A(D^). (2) 


But equation (2) will hold for only a few special cases and not for a general map T. 
For example, define T by T (u, v) = (~u? + 4u, v). Restrict T to the unit square; that 
is, to the region D* = [0, 1] x [0, 1] in the wv plane (see Figure 6.2.1). Then, as in 
Exercise 3, Section 6.1, T takes D* onto D = [0, 3] x [0, 1]. Clearly, A(D) # A(D?), 
and so formula (2) is not valid. 


Figure 6.2.1 The map T: (u, v) > (~u? + 4u, v) takes the square D* onto the 
rectangle D. 
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Jacobian Determinants 


To rectify the incorrect formula (1), we need a measure of how a transformation 
T: R? — R? distorts the area of a region. This is given by the Jacobian determinant, 
which is defined as follows. 


DEFINITION: Jacobian Determinant Let T: D* c R? —> R? be a C! 
transformation given by x = x(u, v) and y = y(u, v). The Jacobian determi- 
nant of T, written d(x, y)/d(u, v), is the determinant of the derivative matrix 
DT(u, v) of T: 


Ox ax 
9(.») |8u ðv 
d(u, v) dy Qy| 

ðu ðv 


2259580) The function from R? to IR? that transforms polar coordinates into 
Cartesian coordinates is given by 


x =rcosé, y=rsind 
and its Jacobian determinant is 


a(x, y) _ 
a(r,@) -— 


cos0 —rsin@ 
sinô  rcosO 


= r(cos? 0 -- si 8) =r. A 


Under suitable restrictions on the function 7, we will argue below that the 
area of D — T(D*) is obtained by integrating the absolute value of the Jacobian 
a(x, y)/8(u, v) over D*; that is, we have the equation 


A(D) = If, dx dy = 


To illustrate: From Example 1 in Section 6.1, take T: D* — D, where D = 
T(D*) is the set of (x, y) with x? + y? < 1 and D* = [0, 1] x [0, 27], and T(r, 0) = 
(r cos, r sin 0). By formula (3), 


sn» ff 


36. »)| , 
+ LO(u, ij^ 


dv. (3) 


a(x, y) = 
306) 4? ff e (4) 
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(here r and 0 play the role of u and v). From the preceding computation it follows 


that 
2x pl 2x r,21]! 1 27 
If rdrao = | n [ | ] dé J) d0 =x 
: o Jo o L2]o 2 Jo 


is the area of the unit disk D, confirming formula (3) in this case. In fact, we may 
recall from first-year calculus that equation (4) is the correct formula for the area of 
a region in polar coordinates. 

It is not so easy to rigorously prove assertion (3). However, looked at in the 
proper way, it becomes quite plausible. Recall that A(D) = ff, p dx dy was obtained 
by dividing up D into little rectangles, summing their areas, and then taking the limit 
of this sum as the size of the subrectangles tended to zero. The problem is that 7 may 
map rectangles into regions whose area is not easy to compute. The solution is to 
approximate these images by simpler regions whose area we can compute. A useful 
tool for doing this is the derivative of 7, which we know (from Chapter 2) gives the 
best linear approximation to T. 

Consider a small rectangle D* in the wv plane as shown in Figure 6.2.2. Let 7" 
denote the derivative of T evaluated at (uo, vo), so T’ is a 2 x 2 matrix. From our 
work in Chapter 2, we know that a good approximation to T(u, v) is given by 


,( Au 
T(uo, vo) + T Es i 


T(uo, vo) + T (D*) 


ox. ay; 
Avi tjAv— 
Ls MEE 
2 


Gt Yo) 


= T (uo, vo) \ Xi vj 
i HL 


(uo vo) — Aui 


Figure 6.2.2 The effect of the transformation T on a small rectangle D*. 


where Au = u — ug and Av = v — vo. This mapping 7” takes D* into a parallelogram 
with vertex at T (uo, vo) and with adjacent sides given by the vectors 


ax Ox ax 
T'(Aui) = 5" an [o= s = AuT, 


ðu ðv ðu 


o 
x] 
Eel 
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and 
dx ax Ox 
mann | Iu Ov el. av | _ 
T'(Avj) = dp dv [4] = Av "s AT,, 
ðu ðv ðv 
where 
ax ay Ox oy 
T, = —i-t —j d T, = —i+ — 
“au i au! S ° a av! 


are evaluated at (uo, vo). 
Recall from Section 1.3 that the area ofthe parallelogram with sides equal to the 
vectors ai + bj and ci + dj is equal to the absolute value of the determinant 


a c 
b d 


a b 
c d 


Thus, the area of T(D*) is approximately equal to the absolute value of 


Ox Ox ax ax 
oe 3e" [au 3 ax, y) 
ae v f=] ol Au Av = Au Av 
ay ay dy ay a(u, v) 
Au Av 
ðu ðv ðu ðv 


evaluated at (uo, vo). 

This fact and a partitioning argument should make formula (3) plausible. In- 
deed, if we partition D* into small rectangles with sides of length Au and Av, the 
images of these rectangles are approximated by parallelograms with sides T, Au 
and T, Av, and hence with area |d(x, y)/d(u, v)| Au Av. Thus, the area of D* 
is approximately 3^ Au Av, where the sum is taken over all the rectangles R in- 
side D* (see Figure 6.2.3). Hence, the area of T(D") is approximately the sum 


v 


Figure6.2.3 The area of the little rectangle R is Au Av. The area of T(R) is 
approximately |8(x, y)/8(u, v)|AuAv. 
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318G, y)/8(u, v)| Au Av. In the limit, this sum becomes 


If. a(x, y) 


d(u, v) 

Let us give another informal argument for the special case (4) of formula (3), 
that is, the case of polar coordinates. Consider a region D in the xy plane and a 
grid corresponding to a partition of the r and @ variables (Figure 6.2.4). The area 
of the shaded region shown is approximately (Ar)(r;, A0), because the arc length 
of a segment of a circle of radius r subtending an angle ¢ is r$. The total area is 
then the limit of ) ^ rj, Ar A6; that is, ffp. r dr dO. The key idea is thus that the jkth 
“polar rectangle" in the grid has area approximately equal to r;, Ar A0. (For n large, 
the jkth polar rectangle will look like a rectangle with sides of lengths r;, A0 and 
Ar). This should provide some insight into why we say the “area element dx dy” is 
transformed into the "area element rdr d0." 


du dv. 


Figure 6.2.4 D* is mapped to D under the polar-coordinate mapping T. 


DGAIA Let the elementary region D in the xy plane be bounded by the 
graph of a polar equation r = f(0), where 0) < 0 < 0; and (0) > 0 (see Fig- 
ure 6.2.5). In the r0 plane we consider the r-simple region D* where 6) < 0 < 6, 


Figure 6.2.5 The effect on the region D* of the polar-coordinate mapping. 
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and 0 <r < f(@). Under the transformation x = r cos0, y =r sin 0, the region D* 
is carried onto the region D. Use equation (4) to calculate the area of D. 


SOLUTION 
a(x, y) 


ae ff, ^ JA. 
= ff rmm = f [S rar] dé 


| r423/0) 6, 2 
"n [z] go [ LOL y 
& L2 


0 @ 2 


dr d6 


This formula for A(D) should be familiar from one-variable calculus. A 


Change of Variables Formula 


Before stating the two-variable change of variables formula, which is the culmination 
of this discussion, let us recall the corresponding theorem from one-variable calculus 
that goes under the name the method of substitution: 


b dx x(b) 
Í Sa) du = [ NI 6) 


where f is continuous and u > x(u) is continuously differentiable on [a, b]. 


PROOF Let F bean antiderivative of f; that is, F' = f, whose existence is guar- 
anteed by the fundamental theorem of calculus. The right-hand side of equation (5) 
becomes 


x(b) 
f(x) dx = F(x(b)) — F(x(a)). 


x(a) 


To evaluate the left-hand side of equation (5), let G(u) = F(x(u)). By the chain rule, 
G'(u) = F'(x(u))x'(u) = f(x(u))x'(u). Hence, again by the fundamental theorem, 


b b 
f fowr'wau= f G'(u)du = G(b) — G(a) = F(x(b)) — F(x(a)), 


asrequired. m 


Suppose now that we have a C! function u ++ x(u) that is one-to-one on [a, b]. 
Thus, we must have either dx/ du > 0 on [a, b] or dx/du < 0 on [a, b].! Let I* 
denote the interval [a, b], and let / denote the closed interval with endpoints x(a) 


‘If dx /du is positive and then negative, the function x = x(u) rises and then falls, and thus is not one-to-one; a similar 
statement applies if dx / du is negative and then positive. 
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and x(b). (Thus, 7 = [x(a), x(b)] if u + x(u) is increasing and J = [x(5), x(a)] if 
u +> x(u) is decreasing.) With these conventions we can rewrite formula (5) as 


[soon 


This formula generalizes to double integrals, as was already given informally in for- 
mula (3): /* becomes D*, J becomes D, and |dx /du| is replaced by |8(x, y)/8(u, v)]. 
Let us state the result formally (the technical proof is omitted). 


dx 
EA du= [ reos. 


THEOREM 2: Change of Variables: Double Integrals Let D and 
D* be elementary regions in the plane and let 7: D* — D beofclass C'; suppose 
that T is one-to-one on D*. Furthermore, suppose that D = T(D"). Then for 


any integrable function f: D — IR, we have 


a(x, 
1 T, Jæ ydedy = ] ji fex, v), yu, v) x. 2| dudv. (6) 


,v) 


One of the purposes of the change of variables theorem is to supply a method 
by which some double integrals can be simplified. One might encounter an integral 
Ff, p f dA for which either the integrand f or the region D is complicated and for 
which direct evaluation is difficult. Therefore, a mapping 7' is chosen so that the 
integral is easier to evaluate with the new integrand f o T and with the new region 
D* [defined by 7(D*) = D]. Unfortunately, the problem may actually become more 
complicated if T is not selected carefully. 


[CUIUS] Let P be the parallelogram bounded by y = 2x, y = 2x —2, 
y =x, and y =x + l (see Figure 6.2.6). Evaluate ff^, xy dx dy by making the 


Figure 6.2.6 The 
effect of T(u, v) = 
(u — v, 2u — v) on 
the rectangle P*. 
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change of variables 
x=u-v, y=2u-v, 
that is, T(u, v) = (u — v, 2u — v). 


SOLUTION The transformation 7 has nonzero determinant and so is one-to- 
one (see Exercise 8, Section 6.1). It is designed so that it takes the rectangle P* 
bounded by v = 0, v = —2, u = 0, u = 1 onto P. The use of T simplifies the region 
of integration from P to P*. Moreover, 


36,3) |. [1 -1]].. 
= | det 2 1 =f 


au, v) 
Therefore, by the change of variables formula, 


0 pl 
If xydxdy= ff (u — v)Qu — v)du dv ' f G9 -3vu + auae 
P p -2 Jo 
0 2 1 0 
24, 3v 2 il 2 4 2 
= = ! dv= z— 4,04 
[i E 2 vul v 3 2" v“ | dv 


Integrals in Polar Coordinates 


Suppose we consider the rectangle D* defined by 0 < 0 < 2z,0 <r <a in the 
r0 plane. The transformation T given by T(r, 0) = (r cos0, r sin@) takes D* onto 
the disk D with equation x? -- y? « a? in the xy plane. This transformation repre- 
sents the change from Cartesian coordinates to polar coordinates. However, T does 
not satisfy the requirements of the change of variables theorem, because it is not 
one-to-one on D*: In particular, T sends all points with r = 0 to (0, 0) (see Figure 
6.2.7 and Example 3 of Section 6.1). Nevertheless, the change of variables theorem is 
valid in this case. Basically, the reason for this is that the set of points where T is not 
one-to-one lies on an edge of D*, which is the graph of a smooth curve and therefore, 
for the purpose of integration, can be neglected. In summary, the formula 


Change of Variables—Polar Coordinates 


i f(x, y)dx dy = If. S(rcosé,r sin0)r dr d0 (7) 
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Figure 6.2.7 The image 
of the rectangle D* under 
the polar-coordinate 
transformation is the 
disk D. 


is valid when 7 sends D* onto D in a one-to-one fashion except possibly for points 
on the boundary of D*. 


DONS OS] Evaluate ff, log (x? + y?) dx dy, where D is the region in the first 
quadrant lying between the arcs of the circles x? + y? = a? and x? + y? = b?, where 
0 <a « b (Figure 6.2.8). 


Figure 6.2.8 The 
polar-coordinate mapping 
takes a rectangle D* onto 
part of an annulus D. 


(40) — (50) 


SOLUTION These circles have the simple equations r — a and r — b in polar 
coordinates. Moreover, r? = x? + y? appears in the integrand. Thus, a change to 
polar coordinates will simplify both the integrand and the region of integration. From 
Example 7, Section 6.1, the polar-coordinate transformation 


x =rcosé, y —rsing 
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sends the rectangle D* given by a <r < b,0 € 0 € z/2 onto the region D. This 
transformation is one-to-one on D* and so, by formula (7), we have 


Í log (x? + y?) dx dy = ‘a I rlogr? d8 dr 
D 
b 
= zl rlogr? &-5] 2r logr dr. 


Applying integration by parts, or using the formula 


2 2 

x x 

l ze dE. 
IE og x dx 2 og x 4 


from the table of integrals at the back of the book, we obtain the result 


x [^ Fid 1 
=f 2r logr dr = — | b?’ logb — a° loga — ~(b? — a°) |. à 
24. 2 2 


DOWIE Ss) The Gaussian Integral One of the most beautiful applications 
of the change of variables formula, polar coordinates, and the reduction to iterated 
integrals is their application to the following formula, known as the Gaussian integral: 


ee 2 
f e™ dx 2 Jm. 


oo 


Not only is this formula very attractive in its own right, but it is also useful in areas 
such as statistics. It also illustrates the unity of the transcendental numbers e and zr 
nearly as well as does the classic formula et” = —1. 

To carry out the integration of the Gaussian integral,” we first evaluate the double 


integral 
I e €) dx dy, 
D, 


?The method that follows is admittedly not straightforward but requires a trick. The trick is to start with the desired formula 
and square both sides. You will then observe that the left-hand side resembles an iterated integral. There are several 
other ways to evaluate the Gaussian integral, but all of them require some nonobvious method. For the use of complex 
variables to evaluate it, see, for example, J. Marsden and M. Hoffman, Basic Complex Analysis, 3rd ed., W. H. Freeman, 
New York, 1998. 
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where D, is the disk x? + y? < a?. Because r? = x? + y*, and dx dy =r dr d0, the 
change of variables formula gives 


-— Qn pa, 2r i 
Il e e "D dx dy = f [ e*rdrd0 = f (6 ) 
Da o Jo 0 2 


2n 


---]| (*-n0ae-zü-e*. 


a 


do 
0 


If we let a — oo in this expression, we give meaning to the improper integral 


and get 
I e dy dy — z. 
R? 


Assuming (as shown in the Internet supplement) that we can also evaluate this im- 
proper integral as the limit of the integrals over the rectangles R; = [—a, a] x [—a, a] 


asa — oo, we get 
lim i e C dx dy — m. 
arco J Jp, 


By reduction to iterated integrals, we can write this as 


a a a 2 
lim [/ e ax f e ay = [im f ex as Lm. 
aoo | Ja d aoo] a 
oo " 2 
pi e ax -—. 
—oo 


Thus, taking square roots, we arrive at the desired result. 
Here is a variant of the Gaussian integral. Evaluate 


v9 2 
f e dx. 
—oo 


To do this, use the change of variables formula y = 4/2x to reduce the problem to the 
Gaussian integral just computed: 


That is, 


oo a + VJ2a d 
i e? dx = lim | e? dx = lim gre 
-— a>% J 4-00 JJa A2 
A 


== [ w= ZA T 
“Mle YS Ve 
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-l 


Change of Variables Formula for Triple Integrals 


To state this formula, we first define the Jacobian of a transformation from IR? to 
TR?— it is a simple extension of the two-variable case. 


DEFINITION | Let T: W c R? > R? bea C! function defined by the equations 
x = x(u, v, w), y = y(u, v, w), z = z(u, v, w). Then the Jacobian of T, which is 
denoted d(x, y, z)/8(u, v, w), is the determinant 

ax dx ax 

du ðv ðw 

ay dy dy 

ðu dv dw] 

dz Oz dz 

du ðv dw 


The absolute value of this determinant is equal to the volume of the parallelepiped 
determined by the three vectors 


T mI TE: 
um u +2 
Ox oy üz 
T, = i+ j+ Lk, 
(79v tatg 


Just as in the two-variable case, the Jacobian measures how the transformation T 
distorts the volume of its domain. Hence, for volume (triple) integrals, the change of 
variables formula takes the following form: 


Change of Variables Formula: Triple Integrals 


I, fe, y, z) dx dy dz 


a s 
- fff f(x(u, v, w), y(u, v, w), z(u, v, up| rd 


(8) 
dudvdw, 


where W* is an elementary region in uvw space corresponding to W in xyz space, 
under a mapping T: (u, v, w) e (x(u, v, w), y(u, v, w), z(u, v, w)), provided T 
is of class C! and is one-to-one, except possibly on a set that is the union of graphs 
of functions of two variables. 
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Cylindrical Coordinates 


Let us apply formula (8) to cylindrical and then to spherical coordinates. First, we 
compute the Jacobian for the map defining the change to cylindrical coordinates. 


Because 
x =r cos, y —rsin6, Z-—£, 
we have 
cos@ -—rsinO0 0 
LL sing rcosó O|—r. 
a(r, 0, z) 0 0 1 


Thus, we obtain the formula 


Change of Variables—Cylindrical Coordinates 


ll fG. y,z)dxdydz = I f(r cos0,rsin0,z)rdrd0dz. (9) 
w we 


Spherical Coordinates 


Next we consider the spherical coordinate system. Recall that it is given by 
x = psingcosé, y = psinósin6, z = p cos. 
Therefore, we have 
a(x. y.2) sin$cosÓ0 —psingsin@ cos cos 
TWO m sinósinÜ psing@cos@  pcosósin6|. 
(p. 0, 9) coso 0 —psing 
Expanding along the last row, we get 


a(x, yz) _ ó —psin$sinÓ pcospcosé 
ad) S psingcos@ pcosdsind 


sin$cosÓ0 —psingsin@ 


—p sing singsin@ — psin$cosó 


= -—p cos? ¢ sing sin? 0 — o? cos? $ sing cos? 6 
=p? sin? $ cos? 0 — pl sin? $ sin? 0 


= —p? cos? p sing — p? sin? $ = —p? sing. 
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Thus, we arrive at the formula: 


Change of Variables—Spherical Coordinates 


ll p Pul) indui 


(10) 
= Ii f(p sin $ cos, p sing sin, p cos $)p? sing dp dé d$. 
we 


To prove formula (10), one must show that the transformation S on the set W* is 
one-to-one except on a set that is the union of finitely many graphs of continuous 
functions. We shall leave this verification as Exercise 34. 


AMPLE 6 Evaluate 


ni exp? +y?  z2? dV, 
Ww 


where W is the unit ball in IR?. 


SOLUTION First note that we cannot easily integrate this function using iterated 
integrals (try it!). Hence (employing the strategy in the quote that opened this chapter), 
let us try a change of variables. The transformation into spherical coordinates seems 
appropriate, because then the entire quantity x? + y? + z? can be replaced by one 
variable, namely, p?. If W* is the region such that 


O<psl, O<@<2n7, O<¢<z, 


we may apply formula (10) and write 


Hii exp (x? +y? +2)? qy = "i pre” sing dp dé d$. 
w we 


This integral equals the iterated integral 
1 pr pan j 1 px à 
Í [ [ e” p sing dô dp dp = 2x | f e” p? sing d$ dp 
0 Jo JO 0 J0 
1 
= -2x | pre” [cos 9]; do 
0 
Lata 4 fU paa 
-«[ e? p do- in| e? (30^) dp 
0 3 Jo 


4 t- 
= EU =37(e-1). 4 
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I 1302897] Let W be the ball of radius R and center (0, 0, 0) in IR?. Find the 
volume of W. 


SOLUTION The volume of W is fff, w dx dy dz. This integral may be evaluated 
by reducing it to iterated integrals or by regarding W as a volume of revolution, but 
let us evaluate it here by using spherical coordinates. We get 


2r 2x 
I dxrdydz= f" | 3 p sing dp dd dd = — 3 =f" sind d0 dp 


3 
ERE sing dd UN (-[cos(x) — cos(0)]} = MEE š 
3 Jo 3 3 


which is the standard formula for the volume of a solid sphere. A 


EXERCISES 
1. Let D be the unit disk: x? + y? < 1. Evaluate 


If exp (x? + y?) dx dy 
D 


by making a change of variables to polar coordinates. 


2. Let D be the region 0 € y < x and 0 < x < 1. Evaluate 


[[ eoo 


by making the change of variables x = u + v, y = u — v. Check your answer by evaluating 
the integral directly by using an iterated integral. 


3. Let T(u, v) = (x(u, v), y(u, v)) be the mapping defined by T(u, v) = (4u, 2u + 3v). Let 
D* be the rectangle [0, 1] x [1, 2]. Find D — T(D*) and evaluate 


(a) [ie (b) [[ e» 


by making a change of variables to evaluate them as integrals over D*. 
4. Repeat Exercise 3 for T (u, v) = (u, v(1 + u)). 
5. Evaluate 
f "i dx dy 
DA -cx-c-2y 
where D = [0, 1] x [0, 1], by setting T(u, v) = (u, v/2) and evaluating an integral over D*, 
where 7(D*) = D 


6. Define T(u, v) = (u? — v?, 2uv). Let D* be the set of (u, v) with u? + v? < 1, u > 0, 
v > 0. Find 7(D*) = D. Evaluate ff, dx dy. 
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7. Let T(u, v) be as in Exercise 6. By making a change of variables, “formally” evaluate the 
*improper" integral 


f T dx dy 
D yX? + y 

[Note: This integral (and the one in the next exercise) is improper, because the integrand 
1/,\/x? + y? is neither continuous nor bounded on the domain of integration. (The theory of 
improper integrals is discussed in Section 6.4.)] 

1 
8. Calculate I moe: dy dx, where R is the region bounded by x = 0, y 20, x +y — 1, 
RXTY 

x + y = 4, by using the mapping T(u, v) = (u — uv, uv). 


9. Evaluate Í | (x? + y^? dx dy where D is the disk x? + y? < 4. 
D 


10. Let D* be a v-simple region in the uv plane bounded by v = g(u) and v = h(u) < g(u) 
fora <u € b. Let T: R? — R? be the transformation given by x = u and y = y(u, v), 
where y/ is of class C! and dy//dv is never zero. Assume that T(D*) = D is a y-simple 
region; show that if f: D — R is continuous, then 


n - ff. fu, Wu, v) 


11. Use double integrals to find the area inside the curve r = 1 + sin@. 


By 


dv du dv. 


12. (a) Express f k a xy dy dx as an integral over the triangle D*, which is the set of (u, v) 
where 0 € u € 1,0 < v < u. (Hint: Find a one-to-one mapping T of D* onto the given 
region of integration.) 

(b) Evaluate this integral directly and as an integral over D*. 
13. Integrate ze" ^^" over the cylinder x? + y? < 4,2 < z <3. 


14. Let D be the unit disk. Express II (1 +x? + ?y*? dx dy as an integral over 
[0, 1] x [0, 277] and evaluate. g 


15. Using polar coordinates, find the area bounded by the lemniscate (x? + y? = 
2a? (x? — y?) 


16. Redo Exercise 11 of Section 5.3 using a change of variables and compare the effort 
involved in each method. 


17. Calculate n + y} e dx dy where R is the region bounded by x + y = 1, 
R 
x+y=4,x-y=—l,andx—y=l1. 


18. Let T: IR? — R? be defined by 


T(u, v, w) = (u cos v cos w, u sin v cos w, u sin w). 


The Change of Variables Formula and Applications of Integration 


(a) Show that T is onto the unit sphere; that is, every (x, y, z) with x^ + y? cz = 1 
can be written as (x, y, z) = T(u, v, w) for some (u, v, w). 
(b) Show that T is not one-to-one. 


19. Integrate x? + y? + z? over the cylinder x? + y? < 2, —2 < z <3. 
20. Evaluate fj" e? dx. 
21. Let B be the unit ball. Evaluate 
| | T dx dy dz 
B2 xb y +27 
by making the appropriate change of variables. 


22. Evaluate io + Y)y] dx dy where A is determined by the conditions x? + y? < 1 
A 
and x 4- y 7 1. 


dx dy d. 
23. Evaluate II, @ a ae " zy , where W is the solid bounded by the two spheres 


x? +y? +27 =a? and x? + y? +2? = B5, where 0 < b <a. 


24. Evaluate i] fi x? dx dy where D is determined by the two conditions 0 < x « y and 
x? y! xl. 


25. Integrate /x? + y? + z? e- 6 2^ over the region described in Exercise 23. 
26. Evaluate the following by using cylindrical coordinates. 


(a) I z dx dy dz where B is the region within the cylinder x? + y? = 1 above the 
B 


xy plane and below the cone z = (x? + y?)!2. 
(b) Í n | (x? + y? zy? dx dy dz where W is the region determined by the 
wW 


conditions i <z<landx?4+y?+2? «1. 


27. Evaluate (x + y) dx dy where B is the rectangle in the xy plane with vertices at 
B 
(0, 1), (1, 0), (3, 4), and (4, 3). 


28. a ff (x + y) dx dy where D is the square with vertices at (0, 0), (1, 2), (3, 1), 
and (2, — 
29. Let E be the ellipsoid (x?/a?) + (y?/b?) + (z?/c?) < 1, where a, b, and c are positive. 
(a) Find the volume of E. 
(b) Evaluate I [G2/a?) + (9/b?) + (2?/c?)] dx dy dz. (HINT: Change variables 
and then use spherical coordinates.) 
30. Using spherical coordinates, compute the integral of (p, $, 0) = 1/p over the region in 
the first octant of R?, which is bounded by the cones $ = 7/4, $ = arctan 2 and the sphere 


p = 46. 
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31. The mapping T(u, v) = (u? — v?, 2uv) transforms the rectangle 1 < u < 2,1 <v <3 
of the uv plane into a region R of the xy plane. 


(a) Show that T is one-to-one. 
(b) Find the area of R using the change of variables formula. 


32. Let R denote the region inside x? + y? = 1, but outside x? + y? = 2y with 
x>0,y>0. 


(a) Sketch this region. 
(b) Letu = x? + y?, v = x? + y? — 2y. Sketch the region D in the uv plane, which 
corresponds to R under this change of coordinates. 


(c) Compute I xe” dx dy using this change of coordinates. 
R 


33. Let D be the region bounded by x*/? + y3/? = a*/?, for x > 0, y > 0, and the coordinate 
axes x = 0, y = 0. Express If J (x, y) dx dy as an integral over the triangle D*, which is 
D 
the set of points 0 < u < a, 0 < v < a — u. (Do not attempt to evaluate.) 
34. Show that S(p, 0, $) = (p sin ġ cos6, p sing sin, p cos $), the spherical change-of- 


coordinate mapping, is one-to-one except on a set that is a union of finitely many graphs of 
continuous functions. 


6.3 Applications 


Averages 


In this section, we shall discuss average values, centers of mass, moments of inertia, 
and the gravitational potential as applications. 


If x, ..., x, are n numbers, their average is defined by 
xit tx 1S 
[xilay = ————— =- 2x. 
n n 


Notice that if all the x; happen to have a common value c, then their average, of 
course, also equals c. 
This concept leads one to define the average values of functions as follows. 


Average Values The average value of a function of one variable on the interval 
[a, b] is defined by 


b 
d. 
ppt ie, 


Likewise, for functions of two variables, the ratio of the integral to the area of D, 


_ Jo FŒ y)dx dy 


cel m () 
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is called the average value of f over D. Similarly, the average value of a function 
f ona region W in three space is defined by 


Mw S (x, y. z) dx dy dz 


[fav = 


Why dx dy dz 


Again, notice that the denominator is chosen so that if f is a constant, say c, then 


[flav =e. 


DANIE Find the average value of f(x, y) = x sin’(xy) on the region D = 
[0, x] x [0, 7]. 


SOLUTION First, we compute 


= Ls m 9 
f| re e = f [ x sin (xy) dx dy 
= F TA 1— cos(2xy) a] gis 
o Lo 2 
*Ty  sin2xy)] |" 
f [5 UNUM IE dx 
[a [2 sin(2zx) " mx? | mm 
~ Jo L 2 4 I a 8r 


y=0 
zx? cos(27?) — 1 
s! cord- 

8r 


n 


0 
Thus, the average value of f, by formula (1), is 


m?/4 +[cos(2x?) — 1]/8% m  cos(2x?)— 1 
2 t 
x 4 &? 


~ 0.7839. a 


ESAE The temperature at points in the cube W = [—1, 1] x [-1, 1] x 


[-1, 1] is proportional to the square of the distance from the origin. 


(a) What is the average temperature? 


(b) At which points of the cube is the temperature equal to the average temperature? 


SOLUTION (a) Let c be the constant of proportionality so T = c(x? + y? + 2?) 
and the average temperature is [T]ay = H Í (i) w T dx dy dz,because the volume of the 


6.3 Applications 395 


cube is 8. Thus, 


1 1 1 
[T lav = al il li (x2 + y! +22) dx dy dz. 
-1J-1J-4 


The triple integral is the sum of the integrals of x?, y?, and z?. Because x, y, and z 
enter symmetrically into the description of the cube, the three integrals will be equal, 
so that 


S 1 jj 
Ed f f 2 de dyde =£ [ 2 (/ f axd) dz. 
8 Jai Jda 8 Ja EE 


The inner integral is equal to the area of the square [—1, 1] x [—1, 1]. The area of 
that square is 4, and so 


1 3 1 
[Plum d adag 2 (2 =e 
8 J4 ONS La 


(b) The temperature is equal to the average temperature at all points satisfying c(x? + 
y? +2?) = c, that is, which lie on the sphere x? + y? + z? = 1, which is inscribed in 
the cube W. A 


Centers of Mass 


If masses mj, ..., m, are placed at points x,,...,x, on the x axis, their center of 
mass is defined to be 
mix, 
ga Era ái 
Limi 


This definition arises from the following observation: If one is balancing masses on a 
lever (Figure 6.3.1), the balance point x occurs where the total moment (mass times 
distance from the balance point) is zero, that is, where J` m;(x; — X) = 0. A physical 
principle, going back first to Archimedes and then in this generality to Newton, states 
that this condition means there is no tendency for the lever to rotate. 


EI 


7 Figure 6.3.1 The lever is 
| " = | | balanced if X(x; — x)m; = 0. 


m3 m, m 


Fora continuous mass density 5(x) along the lever (measured in, say, grams/cm), 
the analog of formula (2) is 


f xà(x) dx 
f 96)dx ' 


x= 


(3) 
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For two-dimensional plates, this generalizes to: 


The Center of Mass of Two-Dimensional Plates 


Wp xx, y) dx dy LE Mp yŒ, y) dx dy 


x= 


henad "Tae ydp O 


where again ó(x, y) is the mass density (see Figure 6.3.2). 


Center of mass Plate 


Figure 6.3.2 The plate balances when supported at its center of mass. 


GTI Ss) Find the center of mass of the rectangle [0, 1] x [0, 1] if the mass 
density is e*^. 


SOLUTION First we compute the total mass: 


0o 1 [ 
I e dxdy = [ h e" dx dy =f (elo) dy af (e^? — e")dy 
D o Jo 0 0 


(el? ey ue —e-(e- 1) =e? -2e +1. 


The numerator in formula (4) for x is 


lel 1 1 
[ f xe" dx dy = n (xe? — e+) ady = [ [e — e!+” — (0e" — e" dy 
0 J0 0 0 


1 
=f e'dy-e'|ly-e 1, 
0 


so that 


2 e—] e-l 
LT 


1 
£ 0.582. 
e—2e+1 (e—-1y -1 


The roles of x and y may be interchanged in all these calculations, so that y — 
1/(e — 1) © 0.582 as well. A 
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For a region W in space with mass density 5(x, y, z), we know that 


volume — III, dx dy dz, (5) 
mass — Il d(x, y, z) dx dy dz. (6) 


Ifone denotes the coordinates ofthe center of mass by (x, y, z), then the generalization 
of the formulas in the preceding box are as follows. 


Coordinates for the Center of Mass of Three-Dimensional Regions 


2 Why x8, y, z) dx dy dz 
X > 


mass 
d(x, y,z)dx dy dz 
5 = iyi ds dy dz > 
mass 
E Why 25(x, y, z) dx dy dz 
m mass E 


I: W7828972 The cube [1,2] x [1,2] x [1,2] has mass density given by 
d(x, y, z) = (1 + x)e y. Find the total mass of the box. 


SOLUTION The mass of the box is, by formula (6), 


2 p2 p2 à pd gt x-2 
f f J (1 +x) y dx dy dz = f f [(« F 5) e| dydz 
| 41 Ji | Ji 1 


2 £25 215 15 1^ 15 
= zeyd az= f ea | d =—(e-e). A 
{f gore 1 4 X |e a 


If a region and its mass density are reflection-symmetric across a plane, then the 
center of mass lies on that plane. For example, in formula (7) for x, if the region and 
mass density are symmetric in the yz plane, then the integrand is odd in x, and so 
X — 0. This kind of use of symmetry is illustrated in the next example. 


Find the center of mass of the hemispherical region W defined by 
the inequalities x? + y? + z? < 1, z > 0. (Assume that the density is unity.) 


SOLUTION By symmetry, the center of mass must lie on the z axis, and 
so X-— y —0. To find Z, we must compute, by formula (7), the numera- 
tor [= Wy zax dy dz. The hemisphere is an elementary region, and thus the 
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integral becomes 


1-y Ji? 
if rpm S piae zdx dy dz. 
i-y9i-z 


Because z is a constant for the x and y integrations, we can remove it from the first 
two integral signs, to obtain 


A -yi-z 
i i dx dy | dz. 
JIZ J—./1-y?= z 


Instead of calculating the inner two integrals explicitly, we observe that they equal the 
double integral ff, dx dy over the disk x? + y? < 1 — z?, considered as an x-simple 
region in the plane. The area of this disk is (1 — z?), and so 


1 1 2 41! 
rer f a1-2)de=n [ € 24 -2[5 2n Z. 


The volume of the hemisphere is x, and so Z = (/4)/(3x) = 2. A 


— Mists 
Itis common knowledge that Archimedes observed the principle of the lever. 
Perhaps less known is that he was also responsible for discovering the 
concepts of center of mass and center of gravity. Only two of his works on 
mechanics have been handed down to us: On Floating Bodies and On the 
Equilibrium and Centers of Mass of Plane Figures. Both were translated into 
Latin by Niccolo Tartaglia, circa 1543. 

In Equilibrium... , Archimedes began the field of applied mathematics, 
doing for mechanics what Euclid had accomplished for geometry. In this 
work he describes the principles behind all the machines of antiquity, 
including the lever, inclined plane, and pulley system. 

Surprisingly, Archimedes never carefully defined the center of mass; the 
first proper definition was given by Pappus of Alexandria in 340 c.E. The 
concept of equilibrium was to have a profound effect on the development of 
mechanical engineering (through the introduction of gears), architecture, 
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and in art, permitting the construction of complex machines, large-scale 
buildings, and sculptures. Figure 6.3.3 shows sketches by Leonardo 
DaVinci, illustrating equilibrium positions of the human body. 


Figure 6.3.3 Equilibrium positions of the human body, to be observed by 
the painter. The center of mass should be supported to maintain equilibrium. 
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Moments of Inertia 


Another important concept in mechanics, one that is needed in studying the dynamics 
of a rotating rigid body, is that of moment of inertia. If the solid W has uniform density 
ô, the moments of inertia I, I,, and I, about the x, y, and z axes, respectively, are 
defined by: 


Moments of Inertia About the Coordinate Axes 


a= ff (9? +27) dx dy dz, i= ff (x? +27) 8 dx dy dz, 
Ww wW 


L «fif (x? + y?) 8 dx dy dz. 
Ww 


(8) 


The moment of inertia measures a body’s response to efforts to rotate it; for example, 
as when one tries to rotate a merry-go-round. The moment of inertia is analogous to 
the mass of a body, which measures its response to efforts to translate it. In contrast to 
translational motion, however, the moments of inertia depend on the shape and not just 
the total mass. It is harder to spin up a large plate than a compact ball of the same mass. 

For example, /, measures the body's response to forces attempting to rotate it 
about the x axis. The factor y? + z?, which is the square of the distance to the x axis, 
weights masses farther away from the rotation axis more heavily. This is in agreement 
with the intuition just explained. 


12.589 H 95) Compute the moment of inertia /, for the solid above the xy plane 
bounded by the paraboloid z = x? + y? and the cylinder x? + y? = a?, assuming a 
and the mass density to be constants. 


SOLUTION The paraboloid and cylinder intersect at the plane z — a?. Using 
cylindrical coordinates, we find from equation (8), 


a (2n pr? a pan pr? 6 
n= | [ f or? -rdzdðdr =6 | [ Í Piire E. « 
o Jo Jo o Jo Jo 3 


Gravitational Fields of Solid Objects 


Another interesting physical application of triple integration is the determination of 
the gravitational fields of solid objects. Example 6, Section 2.6, showed that the 
gravitational force field F(x, y, z) of a particle is the negative of the gradient of a 
function V(x, y, z) called the gravitational potential. If there is a point mass M at 
(x, y, z), then the gravitational potential acting on a mass m at (x1, yi, zi) due to this 
mass is -GmM [(x — x1)? + (y — 1)? + (z — zi Y T2, where G is the universal 
gravitational constant. 
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If our attracting object occupies a domain W with mass density 5(x, y, z), we 
may think of it as made of infinitesimal box-shaped regions with masses dM — 
5(x, y, z) dx dy dz located at points (x, y, z). The total gravitational potential V for 
W is then obtained by “summing” the potentials from the infinitesimal masses. Thus, 
we arrive at the triple integral (see Figure 6.3.4): 


VQ. yi z1) = —Gm JI, E (9) 


xc -n-G-aP 


Gyr) 


Figure 6.3.4 The gravitational potential that produces a force 
acting on a mass m at (xi, yi, 21) arising from the mass dM = 
(x, y, z) dx dy dz at (x, y, z) is -[Gmó(x, y, z) dx dy dz]/r. 
65», 2) 


— teal Noli i 


Let W be a region of constant density and total mass M. Show that 
the gravitational potential is given by 


1 
V(xi, 1,71) = H GMm, 
r av 
where [1/r]ay is the average over W of 


1 
Ve -xP +O -»*-G-a 


foe yz- 


402 The Change of Variables Formula and Applications of Integration 


SOLUTION According to formula (9), 


6dx dy dz 
—V(x1, 91,21) = G 
Db ARD » JJJ, @ P+ -P+ E-z 
dx dy dz 
= Gms ry Tn I 
4 J, Ve — +O -»y + @ — 21? 


iff dx dy dz 
w yx xit -y E-z} 


volume (W) 


= Gm[ volume (W)] 


1 
= GmM H 
r av 


are required. A 


Let us now use formula (9) and spherical coordinates to find the gravitational 
potential V(x), yı, z1) for a region W with constant density between the concentric 
spheres p = p; and p = p», assuming the density is constant. Before evaluating the 
integral in formula (9), we make some observations that will simplify the computation. 
Because G, m, and the density are constants, we may ignore them at first. Because 
the attracting body, W, is symmetric with respect to rotations about the origin, the 
potential V(x;, yı, zi) should itself be symmetric—thus, V(xi, yi, 21) depends only 


onthe distance R = ,/x? + y? +z? from the origin. Our computation will be simplest 


if we look at the point (0, 0, R) on the z axis (see Figure 6.3.5). Thus, we need to 
evaluate the integral 


re, - - fI]. Va a = RY 


(0, 0, R) 


Gp yp zi) 
. Figure 6.3.5 The gravitational potential at (x1, y1, zı) is the same 


as at (0, 0, R), where R = ,/x? + y? 4 zi. 
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In spherical coordinates, W is described by the inequalities oj < p < p2,0 < 0 < 27, 
and 0 < $ < z, and so 


pa n 2x a at 
—V(0, 0, n= f Í [ p“ sind d0 d$ dp l 
gp 0. 0 Je sin? $(cos? 0 + sin? 0) + (p cos — R} 


Replacing cos? 0 + sin? 0 by 1, so that the integrand no longer involves 0, we may 
integrate over Ó to get 


pa m 2c dod, 
=V(0,0, R) = 2x | [ pERQdOdE 
Bi. #0 Je sin? $ + (p cos ó — R}? 


=ar f "oLL8nédó dp 
m 0 Jp? —2Rpcosó + R? ` 


The inner integral over $ may be evaluated using the substitution u = —2 Rp cos $. 
We get 
1 RU 2—1/2 24.2 2y 1/2 did 
mE (o^ +u + R^) "* du = ——(p* +u + R^) 
2Rp J-2np 2Rp -2Rp 


1 
= rid +2Rp + R?)'? — (o? — 2Rp + RD)? 


1 
Rolle + RYT? — Co — RY 17) 
p 
1 
= ga €^ & - Ip Rb. 


The expression p + R is always positive, but p — R may not be, so we must keep the 
absolute value sign. Substituting into the formula for V, we get 


P2 p? Qn 
R R\)dp = 
mer lp D dp R 


V(0,0, R) = 27 f 


p 


pa 
i] p(o+R—|p—Rido. 
Pi 


We consider two possibilities for R, corresponding to the gravitational potential for 
objects outside and inside the hollow ball W. 


Case 1. If R > p [that is, if (x1, 1, zi) is outside W], then |o — R| = R — p for all 
p in the interval [pi, p2], so that 


2x f? 4x [? 1 4r 
-re.0. m =F f pee R-(R- odo =F [doo 53 - wd. 
P 


Pi 


404 


The Change of Variables Formula and Applications of Integration 


The factor (47r/3)(3 — oj) equals the volume of W. Putting back the constants G, m, 
and the mass density, we find that the gravitational potential is — Gm M/ R, where M 
is the mass of W. Thus, V is just as it would be if all the mass of W were concentrated 
at the central point. 


Case 2. If R < p; [that is, if (x1, y1, zi) is inside the hole], then |o — R| = p — R 
for p in [p1, p2], and so 


2 p2 p2 
-V (0,0, R) = (Gm) plp +R- (P = Rdp = (Gmr f oao 


Pı m 


= (Gm2z(pj — pi). 


The result is independent of A, and so the potential V is constant inside the hole. 
Because the gravitational force is minus the gradient of V, we conclude that there is 
no gravitational force inside a uniform hollow planet! 


We leave it to the reader to compute V (0, 0, R) for the case p < R < p». 

A similar argument shows that the gravitational potential outside any spherically 
symmetric body of mass M (even if the density is variable) is V = GMm/R, where 
R is the distance to its center (which is its center of mass). 


DOJDE] Find the gravitational potential acting on a unit mass of a spherical 
star with a mass M = 3.02 x 10% kg at a distance of 2.25 x 10!! m from its center 
(G = 6.67 x 107! N- m?/kg?). 


SOLUTION The negative potential is 


GM . 6.67 x 107!! x 3.02 x 10°° 


$ IY UT = 8.95 x 108 m". a 
B x 


EXERCISES 
1. Find the average of f(x, y) = y sin xy over D = [0, x] x [0, 7x]. 
2. Find the average of f(x, y) = e**" over the triangle with vertices (0, 0), (0, 1), and (1, 0). 
3. Find the center of mass of the region between y = x? and y = x if the density is x + y. 
4. Find the center of mass of the region between y = 0 and y = x?, where 0 < x < i 
5. A sculptured gold plate D is defined by 0 < x < 27 and 0 < y < x (centimeters) and 
has mass density 5(x, y) = y? sin? 4x + 2 (grams per square centimeter). If gold sells for $7 


per gram, how much is the gold in the plate worth? 


6. In Exercise 5, what is the average mass density in grams per square centimeter? 
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7. (a) Find the mass of the box [0, 1] x [0, 1] x [0, 2], assuming the density to be uniform. 
(b) Same as part (a), but with a mass density (x, y, z) = x? + 3y? Tz-l. 


8. Find the mass of the solid bounded by the cylinder x? + y? = 2x and the cone 
z? = x? + y? if the density is 8 = yx? + y?. 


9. Find the center of mass of the region bounded by x + y +z = 2, x = 0, y = 0, and 
z = 0, assuming the density to be uniform. 


10. Find the center of mass of the cylinder x? + y? < 1, 1 < z < 2 if the density is 
ô = (x? ry». 


11. Find the average value of sin? mz cos? rx over the cube [0, 2] x [0, 4] x [0, 6]. 

12. Find the average value of e^ over the ball x? + y? 4- z? < 1. 

13. A solid with constant density is bounded above by the plane z = a and below by the cone 
described in spherical coordinates by $ = k, where k is a constant 0 < k < 7/2. Set up an 


integral for its moment of inertia about the z axis. 


14. Find the moment of inertia around the y axis for the ball x Ty +2? < R? if the mass 
density is a constant ô. 


15. Find the gravitational potential on a mass m of a spherical planet with mass 
M =3 x 10% kg, at a distance of 2 x 10° m from its center. 


16. Find the gravitational force exerted on a 70-kg object at the position in Exercise 15. 


17. A body W in xyz coordinates is called symmetric with respect to a given plane if for 
every particle on one side of the plane there is a particle of equal mass located at its mirror 
image through the plane. 


(a) Discuss the planes of symmetry for an automobile shell. 

(b) Let the plane of symmetry be the xy plane, and denote by W* and W~ the portions 
of W above and below the plane, respectively. By our assumption, the mass density 5(x, y, z) 
satisfies (x, y, —z) = ó(x, y, Z). Justify the following steps: 


I 8(x, y, z) dx dydz = I zó(x, y, z) dx dy dz 
w w 
= fff zœ, 2) dx dyde fff zó(x, y, z) dx dy dz 
w* w- 
= Il zó(x, y, ax dydz+ [ff —wó(u, v, —w)du dv dw 
w+ w+ 
= 0. 


(c) Explain why part (b) proves that if a body is symmetrical with respect to a plane, 
then its center of mass lies in that plane. 
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(d) Derive this law of mechanics: [fa body is symmetric with respect to two planes, then 
its center of mass lies on their line of intersection. 


18. A uniform rectangular steel plate of sides a and b rotates about its center of mass with 
constant angular velocity w. 


(a) The kinetic energy equals 3 (mass)(velocity)?. Argue that the kinetic energy of any 
element of mass 5 dx dy (5 = constant) is given by 5(w?/2)(x? + y?) dx dy, provided the 
origin (0, 0) is placed at the center of mass of the plate. 

(b) Justify the formula for kinetic energy: 


"m 
K.E. =/f 8— (x? + y?) dx dy. 
plate 2 


(c) Evaluate the integral, assuming that the plate is described by the inequalities 
—a/2 xx x a[2, —b[2 < y < b[2. 


19. Asis well known, the density of a typical planet is not constant throughout the planet. 
Assume that planet C.M.W. has a radius of 5 x 105 cm and a mass density (in grams per 
cubic centimeter) 


3 x 10* 


P(X, YZ) = r 
3, r € 104 cm, 


, orz]10'cem, 


where r = yx? + y? + z?. Find a formula for the gravitational potential outside C.M.W. 


6.4 Improper Integrals 


In this section, we study improper integrals—that is, integrals in which the function 
may be unbounded or the region of integration is unbounded. We shall first recall the 
situation for functions of one variable. 


One-Variable Improper Integrals 


In the study of integrals of functions of one variable, one encounters various types of 
“improper” integrals; that is, integrals of unbounded functions defined on intervals 
or integrals of functions over unbounded intervals. For example, 


td e dx 
] ze and f 2 


are improper integrals. They are evaluated using a limiting process; for instance, 


1 1 1 
J pamm, uet mim vel) = ima -2va)=2 
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and 


By o [5 dk _ dla : 1 
i 7 = lim = lim m —-lim[1——-]-21. 
1 x boo J, x? boo x| b->00 b 


If, in such a limiting process, the limit does not exist (or is infinite), we say that the 
integral does not exist (or that the integral diverges). 


Improper Integrals in the Plane 


Next, we describe three types of improper integrals of two variables over a region 
D. The first two types are described in the text below, and the third type (integrals 
over unbounded regions) is left to the exercises. We will evaluate all integrals using 
a limiting process, as in the one-variable case. 

For simplicity of exposition, we first restrict ourselves to nonnegative functions 
f—that is, f(x, y) > 0 for all points (x, y) € D—and to y-simple regions described 
as the set of (x, y) such that 


asx<b, p(x) <y <h), 


as in Figure 6.4.1. 


Y= G(x) 


Figure 6.4.1 A y-simple domain. 


In the first case we wish to consider, let's assume that f: D — R is continuous 
except for points on the boundary of D. Consider, for example, 


1 


where D is the unit disk D = {(x, y)|x? + y? < 1}. Clearly, f is not defined on the 
boundary of D, where x? + y? = 1; yet it will be of practical interest to be able 
to evaluate ff, p J (x, y) dA, because this integral represents the area of the upper 
hemisphere of the unit sphere in three space. 


f(y) = 
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Exhausting Regions 


Our basic idea will be to integrate such an f over a smaller region D’, where we 
know the integral exists, and then let D' “tend” to D; that is, “exhaust” D and see 
if ff, p f dA tends to some limit. With this in mind, we pick a special kind of D’, as 
follows. 

Let 7 > 0 be small enough so that a + n < b-n. Let à > 0 be small enough so 
that $1(x) +8 < $2(x) —8 for all x, a < x < b (see Figure 6.4.2). If $5(x) = pı (x) 
for some x, no such 6 will exist, but we shall worry about this minor issue when it 
arises in our later examples. Then the region 


Dys = (Œ, ya +n <x <b-n and $i) 8 <y € 606) - ô} 


is a subset of D, and as (77, 5) — 0, D,,5 tends to D. 


y Y= 4,0) 


Figure 6.4.2 A shrunken domain D;,; for improper 
integrals. 


Y=) 
G(x) +8 
a atn b-n b 


Improper Integrals as Limits 


Because f is continuous and bounded on D, ;, the integral ff D, ; f d A exists. We 
can now ask what happens as the region D}, expands to fill the region, D—that is, 
as (n, 5) — (0, 0). Provided that 


lim 1 dA 
>00 J Jp, s f 


exists, we say that the integral of f over D is convergent or that f is integrable over 
D, and we define [f^ f dx dy to be equal to this limit. 


DCI eo Evaluate 
1 
—— dA, 
Iv 


where D is the unit square [0, 1] x [0, 1]. 
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SOLUTION D is clearly a y-simple region. Choose 7 > 0 and ô > 0 so that 
Dy, C D, as in Figure 6.4.3. Then, by Fubini's theorem: 


1 l-n f1—-6 1 
i : —— dydx 
ff... V Xy n 8 AX d 


1-n Em 


z^ 3 $^ 


— 3 (4 _ an _ 72) .3 (4 oan _ 523 
X n) n )$ (2) —97 


Figure 6.4.3 The slightly shrunken unit square. 


Letting (7, 5) — (0, 0), we see that 


1 33 
lim JI —dydx=-=--=-. A 
(0) (0.0) J Jp, 5 A/Xy 22 4 


Unfortunately, it may not always be possible to evaluate such limits so directly 
and simply. This is often the case in the most interesting examples, as with the surface 
area of the hemisphere, mentioned earlier. It’s as if the “real world” always presents 
the greatest challenges to the mathematician! So let us expand a bit on our theoretical 
discussion. 


Improper Integrals as Limits of Iterated Integrals 
Suppose f is integrable over D,,5. We can then apply Fubini’s theorem to obtain 


$2(x)—8 
If ma d f(x, y) dy dx. 
Drs at Gi(x)+5 
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Hence, if f is integrable over D, 


b-n  pdr(x)-8 
f 4A — . y)dy dx. 1 
Í Js NM mo, [ NETS (1) 


Now F(n, 5) = Ff Dis f dA is a function of two variables, rj and 5, because as we 
change 7) and ô, we get another number. Now if f is integrable, then 


lim aE „ô 
m (n, ô) = 


exists. It follows that the iterated limits 


um lim F(n,ô) and lim lim F(n, ô) 


also exist and are both equal to L, which in our case is Sh, p f 44. Thus, the iterated 
limit 


$1(x)-6 
lim lim e f(x, y) dy dx 
030920 Jorn Ji o) 


also exists. Conversely, if the iterated limits exist, it does not generally follow that the 


bait din lm o 5) exists. 
1,8 


For enl if it were to turn out in some way that F(n, 8) = nê /(n? + $^), then 
lim lim F(n, 6) = lim Im F(n, 9) — 0; yet í lim (9) does not exist, because 
28) 


1205-0 
F(n, n) = 1/2 des. Section 22); 
In view of this, consider expression (1) again. If f is integrable, then 


42(x)-8 
J f(x, y)dA= lim M S(x,y) dy dx 
D an Jo 


(57 (0,0) TS 


p(x) 
= lim lim Cp f(x, y) dy dx. 
1-5(0)5—0 Ja "m 


Now suppose that for each x, 


(x)-8 
imf fend» 
50 Ji )46 


exists. Denote this by f Pe n) f(x, y) dy. Suppose further that 


b-n porlx) 
lim f f(x, y)dy 


19 Jae Jp) 


also exists. We denote this limit by j ie f(x, y) dy dx. Then ifall limits exist, all 


limits must be equal. Thus, f is integrable and the iterated improper integral exists, 
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then necessarily 


f[ ream PL tenaa. 


However, is it possible that the existence of just the iterated integrals implies the 
integrability of f? We turn to this important question next. 


Fubini's Theorem for Improper Integrals 


For integrals, something truly remarkable happens. Unlike the case for iterated 
limits (as in the counterexample considered earlier), the existence of the iterated 
limits does imply the integrability of f as long as f > 0. Thus, if f > 0 and if 
j J HO f. y)dy dx exists as an iterated limit, then f is integrable and 


dx) 
[[ re. [ ' [^ fa dad 


a Jd 


If D is an x-simple region with the x coordinate lying between two functions yı and 


V», and if 
d p) 
[fl S(x,y) dx dy 
c Jn) 


exists as an improper integral, it again follows that f is integrable and 


ff re »- [ ' Í E fl, y) dx dy. 


All these results, which are the improper analogues of Theorems 4 and 4' in Section 
5.3, are known as Fubini s theorem for improper integrals, which we formally state. 


THEOREM 3: Fubini's Theorem Let D be an elementary region in the 
plane and f > 0 a function continuous except for points possibly on the boundary of 
D. If either of the integrals 


Sf re.» dA, 


b ph) 
f [ f(x, y)dy dx, for y-simple regions 
a 4$) 


d (yaQ) 
f f f(x, y) dx dy for x-simple regions 
c Jyly) 


exist as improper integrals, f is integrable and they are all equal. 
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The proof of this involves advanced concepts of analysis, so we omit it here. This 
result can be quite useful in calculation, as the next example shows. 


DYB Let f(x, y) = 1//1 — x? — y?. Show that f is integrable and 
that [fp f(x, y) dA = 2x, half the surface area of the unit sphere. 
SOLUTION For —1 < x < 1, we have 


[pud dy 14x7-5 dy 


-m = lim ————— 
-VI y1 =x? — y?  $90J 2345 y1 — x? — y? 


r Vi-x?-5 
= lim sin! ( ) | 
B98 V1 —x?/ |- 


5 ô 
= lim fsin™! ( 1 ) sin ( 1+ ) 
fim [sin ( gu: Tz 


= sin^!(1) — sin"! (-1) = =n. 


Clearly, 


im f f "E i L. n dx = limz(2 — 2n) = 27 
= xm - = Zn. 
n>0J-i4n J-V y1 x? y? 0J- n>0 


Thus, f is integrable. To see why this theorem is so useful, try to show directly from 
the definition that f is integrable. It is not easy to do so! 4 


EXAMPL Let f(x, y) = 1/(x — y) and let D be the set of (x, y) satisfying 
0 <x <1and0 < y < x. Show that f is not integrable over D. 


SOLUTION Because the denominator of f is zero on the line y = x, f is un- 
bounded on part of the boundary of D. Let 0 < n < 1 and 0 < ô < n, and let D, 5 
be the set of (x, y) with n € x € 1 — 7 and ô < y <x — ô (Figure 6.4.4). 


Figure 6.4.4 The shrunken domain D, for a triangular domain D. 


Here the region D is y-simple with $i(x) = 0, ¢2(x) = x, and $1(0) = ¢2(0). 
To ensure that D}, C D and is depicted in the figure, we must choose 6 a bit more 
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carefully. A little geometry shows that we should choose 28 « n. Consider 


l-n px-d 1 
Í faa =f [ dy dx 
Dis n ô X—y 
1- 


n 
= f teme» 


"7 


l= 
= Í "Eog (8) + log (x — ô)] dx 
n 
1-5 1-n 
= [- log$] dx + log (x — ô) d 
[- log Í x Í og (x — 8)dx 
= —(1 — 2n) logs + [(x — 8) log (x — 8) — (x — ô)”. 


In the last step, we used the fact that f logu du = u logu — u. Continuing the pre- 
ceding set of qualities, we have 


Í f dA = —(1 — 2n) logs + (1 — n — ô) log (1 — n — 8) 
Dis 


—1(1 — n — 8) — (n — ô) log (n — 8) + (n — 8). 


As (7, 5) —> (0, 0), the second term converges to 1 log 1 = 0, and the third and 
fifth terms converge to —1 and 0, respectively. Let v = n — 5. Because v log v > 0 
as v — 0 (a limit established by using D Hópital's rule from calculus?), we see that 
the fourth term goes to zero as (7,5) — (0, 0). It is the first term that will give us 
trouble. Now: ` 


—(1 — 2n) log ô = — log ô + 2n log ô, (2) 


and it is not hard to see that this does not converge as (7, 5) — (0, 0). For example, 
let ņn = 26; then expression (2) becomes — log ô + 46 log ô. As before, 48 log > 0 
as ô — 0, but — logó — +00 as ô — 0, which shows that expression (2) does not 
converge. Hence, " 2m. " Ip, , f dA does not exist and so f is not integrable. a 


LE] 


Functions Unbounded at Isolated Points 


We now consider nonnegative functions f that become “infinite” or are undefined 
at isolated points in an x-simple or y-simple region D. For example, consider the 
function f(x, y) = 1/,/x? + y? on the unit disk D = ((x, y)|x? + y? < 1). Again, 
f = 0, but f is unbounded and is not defined at the origin. 


?L Hópital's rule was discovered by Bernoulli and was reported in L'Hópital's textbook. 
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Let (xo, yo) be a point of a general region D where a nonnegative function f is 
undefined. Further, let Ds = Ds(xo, yo) be the disk of radius 5 centered at (xo, yo) 
and let D\ Ds denote the region D with Ds removed. Assume that f is continuous at 
every point of D except (xo, yo). Then ff, p\p, f dA is defined. We say that Ff f£. aA 
is convergent, or that f is integrable over D if 


lim f dA 
890 J Jp, f 
exists. 
AAB Show that f(x, y) = 1/./x2 + y? is integrable over the unit disk 


D and evaluate ff, fdA. 


SOLUTION Let D; be the disk of radius ô centered at the origin. Then f is 
continuous everywhere on D except at (0, 0). Thus, ff; D\Ds f dA exists. To evaluate 
this integral, we change variables to polar coordinates, x = r cos6, y = r sin@. Then 
f(r cos0, r sin@) = 1/r, and 


1 pan 1 1 p2x 
If paa= [ l vf ao dr = [ d0 dr = 2n(1 — 8). 
D\Ds & Jo F 8 JO 


Thus, 


[Ir f fdA-2m. à 
D 850 J J yp, 


More generally, one can, in an analogous manner, define the integral of nonnega- 
tive functions f that are continuous except at a finite number of points in D. One can 
also combine both types of improper integrals; that is, one may consider functions that 
are continuous except at a finite number of points on D or at points on the boundary 
of D, and define ff, f dA appropriately. 

If f takes both positive and negative values, one can use a more advanced inte- 
gration theory, called the Lebesgue integral, to generalize the notion of convergent 
integral ff^, f dA. Using this theory, it is possible to show that if [fp f dA exists, it 
can then be evaluated as an iterated integral. This latter fact is also known as Fubini's 
theorem. 


Unbounded Regions 


As was mentioned previously, we will leave consideration of unbounded regions to 
the exercise section. However, we must point out that we have already addressed the 
main idea in Example 5 of Section 6.2 on the Gaussian integral. In that example, we 
integrated exp(—x? — y?) over all of IR? by integrating first over a disk of radius a 
and then letting a — oo. 


6.4 Improper Integrals 415 


EXERCISES 
In Exercises 1 to 4, evaluate the following integrals if they exist (discuss how you define the 
integral if it was not given in the text). 


e ne - 
ie J| As tt [0, 1] x [0, 1] 


1 
2. ——-——. dx dy, where D = {(x, O<x<1lO<y<ly<x 
[labs y 165») y ysx} 
3. [formes dy, where D is bounded by x = 1, x = y, and x = 2y 
D 


-— 
4. f f logx dx dy 
0 J0 


5. (a) Evaluate 


Í / dA 
p (t  y!yn 
where D is the unit disk in R? 


(b) Determine the real numbers À for which the integral 


DE 


is convergent, where again D is the unit disk. 


6. (a) Discuss how you would define ffs f dA if D is an unbounded region, for example, 
the set of (x, y) such that a € x < oo and $i(x) € y < $»(x), where $; < $» are given 
(Figure 6.4.5). 

(b) Evaluate ff, xye-* 9" dx dy ifx 20,05 y <1. 


Figure 6.4.5 An unbounded region D. 


y= oi) 


7. Using Exercise 6, integrate e~” for x > 0,1 < y € 2 in two ways. Assuming Fubini’s 
theorem can be used, show that 


œ yx . e- 
f pe ie log2. 
0 


416 


'The Change of Variables Formula and Applications of Integration 


8. Show that the integral 


[ [ "gu val — yA dy dx 


exists, and compute its value. 


9. Discuss whether the integral 


x+y 
a jedy 
UE iiis 


exists where D = [0, 1] x [0, 1]. If it exists, compute its value. 


10. One can also consider improper integrals of functions that fail to be continuous on entire 
curves lying in some region D. For example, by breaking D = [0, 1] x [0, 1] into two 
regions, define and then discuss the convergence of the integral 


Jr ege 


11. Let W be the first octant of the ball x? + y? +z? < a?, where x > 0, y > 0,z > 0. 
Evaluate the improper integral 


Q +y? +2) 
—>= dx dy dz 
w yz 4 (x? 4 y? +27) 


by changing variables. 


12. Let f be a nonnegative function that may be unbounded and discontinuous on the 
boundary of an elementary region D. Let g be a similar function such that f(x, y) < g(x, y) 
whenever both are defined. Suppose ff, g(x, y) dA exists. Argue informally that this implies 
the existence of ff, f(x, y) dA. 


13. Use Exercise 12 to show that 


I sin'(x — y) dvd 

— dy dx 

pnJl—-x?—y? ý 

exists where D is the unit disk x? + y? < 1. 

14. Let f be as in Exercise 12 and let g be a function such that 0 < g(x, y) < f(x, y) 
whenever both are defined. Suppose that ff, g(x, y) dA does not exist. Argue informally that 


Ip fx. y) dA cannot exist. 


15. Use Exercise 14 to show that 


ect 
ll dydx 
px—y 


does not exist, where D is the set of (x, y) with 0 < x < l and 0 < y <x. 
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16. Let D be the unbounded region defined as the set of (x, y, z) with xi yl +221. By 
making a change of variables, evaluate the improper integral 


I Reeacess | dxdydz _ 
TESTEM 

1 py 1 pl 
Wi Tdxdy and NETT 
o Jo Y 0 Jx Y 


Does Fubini's theorem apply? 


17. Evaluate 


18. In Exercise 11 of Section 5.2 we showed that 


Lh eos [ens 


Thus, Fubini's theorem does not hold here, even thought the iterated improper integrals both 
exist. What went wrong? 


REVIEW EXERCISES FOR CHAPTER 6 


1. (a) Find a linear transformation taking the square S = [0, 1] x [0, 1] to the 
parallelogram P with vertices (0, 0), (2, 0), (1, 2), (3, 2). 
(b) Write down a change of variables formula appropriate to the transformation you 
found in part (a). 
2. (a) Find the image of the square [0, 1] x [0, 1] under the transformation T(x, y) = 
(2x, x + 3y). 
(b) Write down a change of variables formula appropriate to the transformation and the 
region you found in part (a). 
3. Let B be the region in the first quadrant bounded by the curves xy — 1, xy — 3, 
x? — y? = l, and x? — y? = 4. Evaluate ff, (x? + y?) dx dy using the change of variables 


u=x?—y?,v=xy. 


4. In parts (a) to (d), make the indicated change of variables. (Do not evaluate.) 
VOX 2 2/2 
(a) ri T Í (x? + y^)! dx dy dz, cylindrical coordinates 
0-92) 


VO) pla? 
(b) f. f xyz dz dx dy, cylindrical coordinates 


A/ (1-3?) Jaf (4x29) 
e5 Va) 
(c) i J z? dz dx dy, spherical coordinates 
—a/ 2-3?) J Af 624) 


z/A p2m 
(y a [ f p? sin 2¢ dO d dp, rectangular coordinates 
o Jo Jo 
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5. Find the volume inside the surfaces x? + y? = z and x? + y? +2? = 2. 

6. Find the volume enclosed by the cone x? 4- y? — z? and the plane 2z — y — 2 — 0. 

7. ^ cylindrical hole of diameter 1 is bored through a sphere of radius 2. Assuming that the 
axis of the cylinder passes through the center of the sphere, find the volume of the solid that 


remains. 


8. Let Cj and C; be two cylinders of infinite extent, of diameter 2, and with axes on the x 
and y axes, respectively. Find the volume of their intersection, C1 N C2. 


9. Find the volume bounded by x /a + y/b + z/c = 1 and the coordinate planes. 
10. Find the volume determined by z < 6 — x? — y? and z > yx? + y?. 

11. The tetrahedron defined by x > 0, y > 0,z > 0, x +y +z < listo be sliced into n 
segments of equal volume by planes parallel to the plane x + y +z = 1. Where should the 


slices be made? 


12. Let E be the solid ellipsoid E = ((x, y, z) | (x?/a?) + (y?/b?) + (z2/c?) < 1) where 


a > 0,b > 0, andc > 0. Evaluate 
"i xyz dx dy dz 


(a) over the whole ellipsoid; and 
(b) over that part of it in the first quadrant: 


xz0, yz0, and zz0, dd se 


13. Find the volume of the “ice cream cone” defined by the inequalities x? + y? < 1z?, and 


0<2<55+,/5—x?—y?. 


14. Let p, 0, ¢ be spherical coordinates in IR? and suppose that a surface surrounding the 
origin is described by a continuous positive function p = f(0, $). Show that the volume 
enclosed by the surface is 


1 2" 
v=; | [eo sinoaoas. 
3 Jo Jo 
15. Using an appropriate change of variables, evaluate 
[[ oto o asy 
B 


where B is the interior of the triangle with vertices at (0, 0), (0, 1), and (1, 0). 


16. Suppose the density of a solid of radius R is given by (1 + d?) ! where d is the distance 
to the center of the sphere. Find the total mass of the sphere. 
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17. The density of the material of a spherical shell whose inner radius is | m and whose 
outer radius is 2 m is 0.4d? g/cm’, where d is the distance to the center of the sphere in 
meters. Find the total mass of the shell. 


18. Ifthe shell in Exercise 17 were dropped into a large tank of pure water, would it float? 
What if the shell leaked? (Assume that the density of water is exactly 1 g/cm?.) 


19. The temperature at points in the cube C = {(x, y,z)| -1 <x € l, -1 € y < l, and 
—l <z < l}is 32d?, where d is the distance to the origin. 


(a) What is the average temperature? 
(b) At what points of the cube is the temperature equal to the average temperature? 


20. Use cylindrical coordinates to find the center of mass of the region defined by 


PEA œ- +y +2 <l, xl. 


1 
"E 
21. Find the center of mass of the solid hemisphere 

V = {(x, y, z) x? +y? +2 <a’ andz > 0) 
if the density is constant. 


22. Evaluate Ne ery dx dy where B consists of those (x, y) satisfying x? + y? < 1 and 
ys. 


23. Evaluate 


ll dx dy dz 
(x? + y + zyn' 


where S is the solid bounded by the spheres x? + y? + z? = a? and x? + y? + z? = b?, where 
a>b>0. 


24. Evaluate f i (x? + y? + z))xyz dx dy dz over each of the following regions. 
D 
(a) The sphere D = {(x, y, z) | x? + y? +2? < R?] 


(b) The hemisphere D = ((x, y, z) | x? +y? +2? x R? andz > 0} 
(c) Theoctant D = ((x, y, z) | x > 0, y > 0,2 > 0, and z? +y? +2? < R?} 


25. Let C be the cone-shaped region {(x, y, z) | yx? + y? < z < 1} in IR? and evaluate the 


integral [fo + yx? + y?) dx dy dz. 
26. Find nii fx y, z) dx dy dz where f(x, y, z) = exp [7G + y? +277]. 
R 


27. The flexural rigidity EI of a uniform beam is the product of its Young’s modulus of 
elasticity E and the moment of inertia / of the cross section of the beam with respect to a 
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horizontal line / passing through the center of gravity of this cross section. Here 


I = ff. [d(x, y)? dx dy, 


where d(x, y) = the distance from (x, y) to / and R = the cross section of the beam being 
considered. 


(a) Assume that the cross section R is the rectangle —1 < x < 1, —1 < y < 2, and/ is 
the line y = 1/2. Find 7. 

(b) Assume the cross section R is a circle of radius 4 and / is the x axis. Find /, using 
polar coordinates. 


28. Find, fff; f(x, y, z) dx dy dz where 


fe 27 EGR yr nap 


29. Suppose D is the unbounded region of IR? given by the set of (x, y) with 0 < x < oo, 
0 < y x x. Let f(x, y) = x-??2e"-* . Does the improper integral ff, f(x, y) dx dy exist? 


30. If the world were two-dimensional, the laws of physics would predict that the 
gravitational potential of a mass point is proportional to the logarithm of the distance 
from the point. Using polar coordinates, write an integral giving the gravitational 
potential of a disk of constant density. 


31. (a) Evaluate the improper integral 


oo py 7 
f f xe? dxdy. 
o Jo 


(b) Evaluate 


n 2x?y? + y*) dx dy, 
B 


where B is the portion of the disk of radius 2 [centered at (0, 0) in the first quadrant]. 


32. Let f be a nonnegative function on an x-simple or a y-simple region D C IR’, and that is 
continuous except for points on the boundary of D and at most finitely many points interior to 
D. Give a suitable definition of ff, f d A. 


33. Evaluate ff.» f(x, y) dx dy where f(x, y) = 1/(1 +x? + y?) (Hint: You may 
assume that changing variables and Fubini's theorem are valid for improper integrals.) 
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Integrals Over Paths 
and Surfaces 


I hold in fact: (1) That small portions of space are of a nature analogous 
to little hills on a surface which is on the average flat. (2) That this 
property of being curved or distorted is continually passed on from one 
portion of space to another after the manner of a wave. (3) That this 
variation of curvature of space is really what happens in that phenomenon 
which we call the motion of matter whether ponderable or ethereal. (4) 
That in this physical world nothing else tahes place but this variation, 
subject, possibly, to the law of continuity. 


W. K. Clifford (1870) 


n Chapter 5, we studied integration over regions in IR? and R°. In this chapter, 

we study integration over paths and surfaces. This is basic to an understanding of 
Chapter 8, in which we discuss the basic relation between vector differential calculus 
(Chapter 4) and vector integral calculus (this chapter), a relation that generalizes the 
fundamental theorem of calculus to several variables. This generalization is summa- 
rized in the theorems of Green, Gauss, and Stokes. 


7.1 The Path Integral 


This section introduces the concept of a path integral; this is one of the several ways in 
which integrals of functions of one variable can be generalized to functions of several 
variables. Besides those in Chapter 5, there are other generalizations, to be discussed 
in later sections. 

Suppose we are given a scalar function f: IR? — R, so that f sends points in R? 
to real numbers. It will be useful to define the integral of such a function f along a path 
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c: I = [a, b] > R?, where e(t) = (x(t), y(t), z(t)). To relate this notion to something 
tangible, suppose that the image of ¢ represents a wire. We can let f(x, y, z) denote 
the mass density at (x, y, z) and the integral of f will be the total mass of the wire. 
By letting f(x, y, z) indicate temperature, we can also use the integral to determine 
the average temperature along the wire. We first give the formal definition ofthe path 
integral and then, after the following example, further motivate it. 


DEFINITION: Path Integrals The path integral, or the integral of 
f(x, y, z) along the path c, is defined when c: I = [a, b] > R? is of class C! 
and when the composite function f > f(x(t), y(t), z(t)) is continuous on J. We 
define this integral by the equation 


b 
[ras= f re oo een a. 
Sometimes f, f ds is denoted 
f f(x, y, z) ds 
or 
b 
[ feneena: 


If c(t) is only piecewise C! or f(e(t)) is piecewise continuous, we define 
J. f ds by breaking [a, b] into pieces over which f(c(t))||e'(f)|| is continuous, 
and summing the integrals over the pieces. 


When f = 1, we recover the definition of the arc length of c. Also note that f need 
only be defined on the image curve C of c and not necessarily on the whole space in 
order for the preceding definition to make sense. 


DAJAT Let c be the helix e: [0, 277] > R?, t + (cost, sint, t) (see Fig- 
ure 2.4.9), and let f(x, y, z) =x? + y? -- z?. Evaluate the integral f. f(x, y, z) ds. 


SOLUTION First we compute ||e’(¢)||: 


2 + 2 2 
ell = e] $ E + [z] = Vsin? t 4- cos? t - 1 = 2. 


dt dt 
Next, we substitute for x, y, and z in terms of t to obtain 


f. yz) Sx y! +2 cost sit =+? 
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along c. Inserting this information into the definition of the path integral yields 


2n 3727 
[reraa= f (14 Pid = i = 2T 3 44r), A 
€ 0 


To motivate the definition of the path integral, we shall consider “Riemann-like” 
sums Sy in the same general way we did to define arc length in Section 4.2. For 
simplicity, let c be of class C! on I. Subdivide the interval J = [a, b] by means of a 
partition 


tahti «-- «ty —b. 


This leads to a decomposition of c into paths c; (Figure 7.1.1) defined on [¢;, t;+1] for 
0 <i < N —1. Denote the arc length of c; by As;; thus, 


Til 
diis f lle Cl at. 
ti 


€(ty) 


€ (fi+1) 


T 


a= hh ti taa ty=b 


€ 


Figure 7.1.1 Breaking c into smaller c;. 


When N is large, the arc length As; is small and f(x, y, z) is approximately constant 
for points on c;. We consider the sums 


N-1 
Sy — L Jis Yis zi) ASi, 
i=0 


where (xi, yi, zi) = c(t) for some t € [t;, ti+1]. By the mean-value theorem we know 
that As; = [e (£7) Aq, where t; € t* < ti+1 and At; = fi+ı — ti. From the theory of 
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Riemann sums, it can be shown that 


N-1 


limit Sy = limit 2 fesso alan = [ fe 90,201 OL as 


= [se v.20 


The Path Integral for Planar Curves 


An important special case of the path integral occurs when the path c describes a 
plane curve. Suppose that all points c(t) lie in the xy plane and f is a real-valued 
function of two variables. The path integral of f along c is 


b 
[ fe, yds = [ fo), v) x + YOR dt. 


When f(x, y) > 0, this integral has a geometric interpretation as the “area ofa fence.” 
We can construct a “fence” with base the image of c and with height f(x, y) at (x, y) 
(Figure 7.1.2). If e moves only once along the image of c, the integral Je f(x, y)ds 
represents the area of a side of this fence. Readers should try to justify this interpre- 
tation for themselves, using an argument like the one used to justify the arc-length 
formula. 


x (0), y (0) 
e() = (x (0), yO) 


Figure 7.1.2 The path integral as the area 
of a fence. 


Tom Sawyer’s aunt has asked him to whitewash both sides of the 
old fence shown in Figure 7.1.3. Tom estimates that for each 25 ft? of whitewashing 
he lets someone do for him, the willing victim will pay 5 cents. How much can Tom 
hope to earn, assuming his aunt will provide whitewash free of charge? 


SOLUTION  FromFigure 7.1.3, the base ofthe fence in the first quadrant is the path 
c: [0, 7/2] > R?,t  (30cos? t, 30 sin? f), and the height of the fence at (x, y) is 
f(x. y) = 1 + y/3. The area of one side of the half of the fence is equal to the integral 
J. fe y)ds = f1 + y/3) ds. Because e(t) = (—90 cos? t sin t, 90 sin? t cos t), we 


7.1 The Path Integral 425 


Slxy)= 14 3 


Figure 7.1.3 Tom Sawyer’s fence. 


€: t ++ (30 cos? t, 30 sin? 7) 


have |le’(t)|| = 90 sin t cos t. Thus, the integral is 


7/2 3 in? t 
[ 142 as = f j 4298 EV chutccwnati 
c 3 0 3 


m/2 
= » f (sint + 10 sin‘ f) cost dt 
0 


+2 
t 
=| 5 2n] 


m/2 


1 
=90(-+2) =22 
` (5 +2) 5, 


which is the area in the first quadrant. Hence, the area of one side of the fence is 
450 ft^. Because both sides are to be whitewashed, we must multiply by 2 to find the 
total area, which is 900 f£. Dividing by 25 and then multiplying by 5, we find that 
Tom could realize as much as $1.80 for the job. A 


This concludes our study of integration of scalar functions over paths. In the 
next section we shall turn our attention to the integration of vector fields over paths, 
and we shall see many further applications of the path integral in Chapter 8, when we 
study vector analysis. 


Supplement to Section 7.1: 'The Total Curvature of a Curve 


Exercises 12 to 17 of Section 4.2 described the notions of curvature « and torsion t 
ofa smooth curve C in space. If c: [a, b] > C C IR? isa unit-speed parametrization of 
C, so that ||e’(t)|| = 1, then the curvature k(p) at p € C is defined by «(p) = |le”(t)||, 
where p — c(t). A result of differential geometry is that two unit-speed curves with 
the same curvature and torsion can be obtained from one another by a rigid rotation, 
translation, or reflection. 
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The curvature x: C — R is a real-valued function on the set C, so we define the 
total curvature as its path integral over C: f, ck ds. There are some surprising facts 
that mathematicians have been able to prove about the total curvature. For one thing, 
if C is a closed [that is, c(a) = c(5)] planar curve, then 


[ 52m. 
[o 


and equals 27 only when C is a circle. If C is a closed space curve with 


] m 
c 


then C is “unknotted”; that is, C can be continuously deformed (without ever inter- 
secting itself) into a planar circle. Therefore, for knotted curves, 


f 57m 
€ 


See Figure 7.1.4. 


Figure 7.1.4 A knotted curve in R°. 


The formal statement of this fact is known as the Fary—Milnor theorem. Legend 
has it that John Milnor, a contemporary of John Nash's! at Princeton University, was 
asleep in a math class as the professor wrote three unsolved knot theory problems 
on the blackboard. At the end of the class, Milnor (still an undergraduate) woke up 
and, thinking the blackboard problems were assigned as homework, quickly wrote 


! John Nash is the subject of Sylvia Nasar's best-selling biography, A Beautiful Mind, a fictionalized version of which was 
made into a movie in 2001. 
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them down. The following week he turned in the solution to all three problems— 
one of which was a proof of the Fary-Milnor theorem! Some years later, he was 
appointed a professor at Princeton, and in 1962 he was awarded (albeit for other work) 
a Fields medal, mathematics’ highest honor, generally regarded as the mathematical 
Nobel Prize. 


EXERCISES 
1. Let f(x, y, z) = y and e(t) = (0, 0, 1), 0 < t < 1. Prove that f. f ds = 0. 


2. Evaluate the following path integrals I fŒ, y, z) ds, where 
(a) f(x, y,z)=x+y+zande: tb (sint, cost, t), t € [0,27] 
(b) f(x, y, z) = cosz, c as in part (a) 
3. Evaluate the following path integrals f. f(x, y, z) ds, where 
(a) f(x, y, z) = exp Jz, and c: t — (1, 2, (2), t € [0, 1] 
(b) f(x, y, z) = yz, and c: t > (t, 3t, 2t), t € [1,3] 
4. Evaluate the integral of f(x, y, z) along the path c, where 
(a) f(x. y. z) = x cosz, eit e» ti + (7j, t € [0, 1] 


(b) f(x, y, z) = (x + y)/(y + z), and e: t > (n aee, 2m € [1,2] 


5. Let f: IR*\{xz plane} — R be defined by f(x, y, z) = 1/5?. Evaluate f, f(x, y, z) ds, 
where c: [1, e] > IR? is given by c(t) = (log t)i + tj + 2k. 


6. (a) Show that the path integral of f(x, y) along a path given in polar coordinates by 
r —r(0)0; <0 x hy, is 


D 


dr V? 
f(e cos0, r sin) |r? + (£) do. 


[7] 
(b) Compute the arc length of the path r = 1 + cos6,0 < 0 < 27. 


7. Let f(x, y) = 2x — y, and consider the path x = t4, y = 15, —1 <t < 1. 


(a) Compute the integral of f along this path and interpret the answer geometrically. 
(b) Evaluate the arc-length function s(¢) and redo part (a) in terms of s (you may wish to 
consult Exercise 2, Section 4.2). 


Exercises 8 to 11 are concerned with the application of the path integral to the problem of 
defining the average value of a scalar function along a path. Define the number 


f. fe y. z)ds 
Ke) 
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to be the average value of f along c. Here l(c) is the length of the path: 


Ke) = [ lle cl dt. 


(This is analogous to the average of a function over a region defined in Section 6.3.) 


8. (a) Justify the formula [ f. f(x, y, z) ds]/ (e) for the average value of f along c using 
Riemann sums. 
(b) Show that the average value of f along c in Example | is (1 + 1x? 
(c) In Exercise 2(a) and (b) above, find the average value of f over the given curves. 


9. Find the average y coordinate of the points on the semicircle parametrized by 
c: [0, 7] > R?, 0 + (0, asin6,acos0);a > 0. 


10. Suppose the semicircle in Exercise 9 is made of a wire with a uniform density of 2 grams 
per unit length. 


(a) What is the total mass of the wire? 
(b) Where is the center of mass of this configuration of wire? (Consult Section 6.3.) 


11. Letc be the path given by e(t) = (72, t, 3) for t € [0, 1]. 


(a) Find /(c), the length of the path. 
(b) Find the average y coordinate along the path c. 


12. If f: [a, b] > R is piecewise continuously differentiable, let the length of the graph of 
f on [a, b] be defined as the length of the path t — (t, f(t) for t € [a, b]. 


(a) Show that the length of the graph of f on [a, b] is 


b 
f V1 +[/' ŒP dx. 


(b) Find the length of the graph of y = log x from x = 1 to x = 2. 


13. Find the mass of a wire formed by the intersection of the sphere x? + y? +z? = 1 and 
the plane x + y + z = 0 if the density at (x, y, z) is given by p(x, y, z) = x? grams per unit 
length of wire. 


14. Evaluate f. f ds where f(x, y, z) = z and e(t) = (t cost, t sint, t) for O < t < to. 


15. Write the following limit as a path integral of f(x, y, z) — xy over some path c on [0, 1] 
and evaluate: 


N-1 
on 2 2 2 
ptd (i-e). 


i= 


where fj, ..., ty is a partition of [0, 1]. 
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16. Consider paths that connect the points A = (0, 1) and B = (1, 0) in the xy plane, as in 
Figure 7.1.5.2 


4g!) 


Circular path Figure 7.1.5 A curve joining the points A and B. 


Path 


Galileo contemplated the following question: does a bead falling under the influence of 
gravity from a point A to a point B along a curve do so in the least possible time if that curve 
is a circular arc? For any given path, the time of transit 7 is a path integral 


dt 
r=[ 
v 
where the bead's velocity is v = y 2gy, where g is the gravitational constant. In 1697, 
Johann Bernoulli challenged the mathematical world to find the path in which the bead 
would roll from A to B in the least time. This solution would determine whether Galileo's 
considerations had been correct. 
(a) Calculate 7 for the straight-line path y = 1 — x. 
(b) Write a formula for T for Galileo's circular path, given by (x — 1)? +(y — 1} = 1. 


Incidentally, Newton was the first to send his solution [which turned out to be a cycloid—the 
same curve (inverted) that we studied in Example 2.4.4], but he did so anonymously. 
Bernoulli was not fooled, however. When he received the solution, he immediately knew its 
author, exclaiming, “I know the Lion from his paw.” While the solution of this problem is a 
cycloid, it is known in the literature as the brachistrochrone. This was the beginning of the 
important field called the calculus of variations. 


7.2 Line Integrals 


We now consider the problem of integrating a vector field along a path. We will begin 
by considering the notion of work to motivate the general definition. 


Work Done by Force Fields 


If F is a force field in space, then a test particle (for example, a small unit charge in an 
electric force field or a unit mass in a gravitational field) will experience the force F. 


? We thank Tanya Leise for suggesting this exercise. 
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Suppose the particle moves along the image of a path c while being acted upon by 
F. A fundamental concept is the work done by F on the particle as it traces out the 
path c. If c is a straight-line displacement given by the vector d and if F is a constant 
force, then the work done by F in moving the particle along the path is the dot 
product F - d: 


F - d = (magnitude of force) x (displacement in direction of force). 


If the path is curved, we can imagine that it is made up of a succession of infinitesimal 
straight-line displacements or that it is approximated by a finite number of straight- 
line displacements. Then (as in our derivation of the formulas for the path integral in 
the preceding section) we are led to the following formula for the work done by the 
force field F on a particle moving along a path e: [a, b] > IR?: 


b 
work done by F — f F(c(t))- e(t) dt. 


We can further justify this derivation as follows. As ¢ ranges over a small interval 
t to t+ At, the particle moves from e(ż) to c(t + At), a vector displacement of 
As = c(t + At) — c(t) (see Figure 7.2.1). 


N 


c(b) 


e——e——e—-———— —e ‘As = e(t + At) — e(t) 
a t t+At b 


Figure 7.2.1 For small At, As = c(t + At) — c(t) © e'(t)At. 


From the definition of the derivative, we get the approximation As ~ e'(r)At. 
The work done in going from c(t) to c(t + At) is therefore approximately 


F(c(1)) - As © F(c(t)) - c(t) At. 
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If we subdivide the interval [a, b] into n equal parts a = tg < tj < +++ < t, = b, with 
At = t;41 — tj, then the work done by F is approximately 


n-i n-l 


D Fett?) As © 5 ^ F(e(tj)) At. 
i=0 i=0 


As n — oo, this approximation becomes better and better, and so it is reasonable to 
take as our definition of work to be the limit of the sum just given as n — oo. This 
limit is given by the integral 


b 
[ F(c(t)) - e'(r) dt. 


Definition of the Line Integral 


The previous discussion of work motivates the following definition. 


DEFINITION: Line Integrals Let F be a vector field on IR? that is con- 
tinuous on the C! path e: [a, b] > IR?. We define fs F - ds, the line integral of 
F along c, by the formula 


F-ds= " Fett) e(t; 
pron} 


that is, we integrate the dot product of F with c' over the interval [a, b]. 
As is the case with scalar functions, we can also define f F-ds if 
F(e(t)) - e'(t) is only piecewise continuous. 


For paths c that satisfy e’(¢) 4 0, there is another useful formula for the line 
integral: Namely, if T(t) = e’(t)/||e’(t)|| denotes the unit tangent vector, we have 


b 
"EI F(c(t)) - e(t) dt (by definition) 


a 


= [ ° [Few Av, teconae (canceling e(l) — (1) 
1 OI 


b 
-[ [F(c(t)) -TONO at. 
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This formula says that f, F - ds is equal to something that looks like the path integral of 
the tangential component F(c(t)) - T(t) of F along c. In fact, the last part of formula (1) 
is analogous to the path integral of a scalar function f along c.? 

To compute a line integral in any particular case, one can either use the origi- 
nal definition or integrate the tangential component of F along c, as prescribed by 
formula (1), whichever is easier or more appropriate. 


DONYO Let c(t) = (sint, cost, t) with 0 < t < 27. Let the vector field F 
be defined by F(x, y, z) = xi + yj + zk. Compute f, F - ds. 


SOLUTION Here, F(e(t)) = F(sin t, cos f, t) — (sin t)i + (cos t)j + tk, and e(t) = 
(cos t)i — (sin t)j + k. Therefore, 


F(c(t)) - c(t) = sint cost — cost sint +t — t, 


2r 
[r-as= f tdt 2227. A 
c 0 


Another common way of writing line integrals is 


and so 


frais= [5 dx + Fy dy + F; dz, 
c e 


where F1, F5, and F; are the components of the vector field F. We call the expression 
Fi dx + F dy + F; dz adifferential form.‘ By definition, the integral of a differential 
form along a path e, where c(t) = (x(t), y(t), z(t)), is 


2 dx dy dz 
Fi dx + F; + Fdz = = - ds. 
[ | dx + Fy) dy + F; dz [ (AG tag +A) a [7 ds 


Note that we may think of ds as the differential form ds = dxi+ dyj + dzk. 
Thus, the differential form F| dx + F; dy + F3dz may be written as the dot product 
F-ds. 


POW ISHS) Evaluate the line integral 


[eas +xydy+ dz, 


c 


where e: [0, 1] > IR? is given by e(t) = (t, t?, 1) = (x(t), y(t), z(t)). 


3If c does not intersect itself (that is, if c(t;) = c(t?) implies tj = 12), then each point P on C (the image curve of c) can 
be written uniquely as c(t) for some t. If we define f(P) = f(c(t)) = F(c) + T(t), f is a function on C; by definition, its 
path integral along c is given by formula (1) and there is no difficulty in literally interpreting f, F - ds as a path integral. 
If c intersects itself, we cannot define f as a function on C as before (why?); however, in this case it is still useful to think 
of the right side of formula (1) as a path integral. 

4See Section 8.6 for a brief discussion of the general theory of differential forms. 
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SOLUTION We compute dx/dt = 1, dy/dt = 2t, dz/dt = 0; therefore, 


1 
[? dx +xydy + dz = { (wor Z + [xOy] 23 dt 


: are oe] M 
= | (-x2*5at-|l)4l76| ==. 
f (P 4-26) [; "sls 
DIBE] Evaluate the line integral 


[9 + e dy + e dz, 


e 
where the path c is defined by e(t) = (1, t, e') and 0 < t <2. 
SOLUTION We compute dx/dt = 0, dy/dt = 1, dz/dt = e', and so 


2 
[mets te dy +e'dz f (O+e+e)dt 
c 0 


lar 1 1 
= le + al = 2e+ z“ a a 
ID. CVNZISESES Let c be the path 
1 
x—cos0, y-smó, z=0, 0<0<— 


Figure 7.2.2 The image of the path 
x = cos! 0, y = si 0, z = 0;0 < 0 < 77/2. 
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SOLUTION In this case, we have 


dx 254 dy 22 
——— sin@ — = 3s SO, Ta 75 dy 
d6 3cos^ 0 sin 0, 46 3sin^ 0 cos PF 1 


so the integral is 


[sinzax + cosz dy — (xy)? dz 


€ 
Tn [2 
= f (—3 cos? 0 sin? 6 + 3 sin? 0 cos? 0 — cos @ sin 0) dé. 
0 


The first two terms cancel, and so we get 


17/2 I 17/2 1 
-f cos@ sin odo = - |; sino =--. A 
0 2 0 2 


DOB ESE] Suppose F is the force vector field F(x, y, z) = x?i + yj 4- zk. 


Parametrize the circle of radius a in the yz plane by letting c(0) have components 


x= 0) y =acosé, z —asin6, 0x0 x2m. 


Because F(c(0)) - c'(0) = 0, the force field F is normal to the circle at every point 
on the circle, so F will not do any work on a particle moving along the circle 
(Figure 7.2.3). 


Figure 7.2.3 A vector field F normal to a circle in the 
yz plane. 


We can verify by direct computation that the work done by F is zero: 


w= [ras [2s + ydy + zdz 
c c 


2x 
= (0 — a? cos@sin@ + a? cos0 sin@)d0=0. 4 
0 


un 
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WISE S65) If we consider the field and curve of Example 4, we see that the 
work done by the field is —1, a negative quantity. This means that the field impedes 
movement along the path. A 


Reparametrizations 


The line integral f. F-ds depends not only on the field F but also on the path 
c: [a, b] > R°. In general, if e; and c? are two different paths in IR?, Js F- ds £ 
Ja, F + ds. On the other hand, we shall see that it is true that f, F - ds = + f, F - ds for 
every vector field F if cı is what we call a reparametrization of c»; roughly speaking, 
this means that e; and c» are different descriptions of the same geometric curve. 


DEFINITION Leth: I > J, be a C! real-valued function that is a one-to-one 
map of an interval J = [a, b] onto another interval J; = [ai, bi]. Let e: J) > IR? be 
a piecewise C! path. Then we call the composition 


p-coh: I> R 
a reparametrization of c. 


This means that p(¢) = e(A(¢)), and so A changes the variable; alternatively, one 
can think of h as changing the speed at which a point moves along the path. Indeed, 
observe that p'(t) = e’(A(t))A’(t), so that the velocity vector for p equals that for c 
but is multiplied by the scalar factor h’(t). 

It is implicit in the definition that // must carry endpoints to endpoints; that is, 
either h(a) = a, and h(b) = bi, or h(a) = bı and h(b) = aj. We thus distinguish two 
types of reparametrizations. If c o A is a reparametrization of c, then either 


(cohYa)-— ea) and — (coh)(b) = c(bi) 
or 
(co h)(a) = c(bi) and (c o h)(b) = c(a). 


In the first case, the reparametrization is said to be orientation-preserving, and a 
particle tracing the path c o A moves in the same direction as a particle tracing c. In 
the second case, the reparametrization is described as orientation-reversing, and a 
particle tracing the path c o h moves in the opposite direction to that of a particle 
tracing c (Figure 7.2.4). 

For example, if C is the image of a path c, as shown in Figure 7.2.5, that is, 
C = c([ai, bi]), and if h is orientation-preserving, then c o A(t) will go from e(a) 
to c(51) as t goes from a to b; and if A is orientation-reversing, c o ^(f) will go from 
€(bi) to c(ai) as t goes from a to b. 
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a b 


Graph of h Graph of h 


a 8 — — — — — —e a $—— —— ————9 bh 


ERWP S 


a, ——————————9 b QO, 
h preserves orientation h reverses orientation 


(a) (5) 


Figure 7.2.4 Illustrating (a) an orientation-preserving reparametrization, and (b) an 
orientation-reversing reparametrization. 


p(b) - e(5i) 


e(h()) = p(t) 


ey 


Figure 7.2.5 The path p=coh 


aye) is a reparametrization of c. 
a)= 1 


a t b a h(t) bi 
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SAN] Lete: (a, b] > R? bea piecewise C! path. Then: 

(a) The path cop: [a, b] > R2, t +> e(a +b — t), is reparametrization of c corre- 
sponding to the map A: [a, b] — [a, b], t — a +b — t; we call cop the opposite 
path to c. This reparametrization is orientation-reversing. 


(b) The path p: [0, 1] > R°, t  c(a + (b — a)t), is an orientation-preserving 


reparametrization of c corresponding to a change of coordinates h: [0, 1] > 
[a,b], t> a - (Pb— ay. à 


THEOREM 1: Change of Parametrization for Line Integrals 
Let F be a vector field continuous on the C! path c: [a;, b1] > IR?, and let 
p: [a, b] — IR? be a reparametrization of c. If p is orientation-preserving, then 


[r5 [rs 
P c 


and if p is orientation-reversing, then 


[r5 - [ras 
P c 


PROOF By hypothesis, we have a map / such that p = c o h. By the chain rule, 
P(t) = hh (A), 
and so 


b 
f F -ds = f [F(e(A(£))) - e'(h())]h (0) dt. 
p a 


Changing variables with s = h(t), this becomes 


Lu 
F(c(5)) - e'(s) ds 
(a) 


E preserving 


-f F(c(s))-c'(s)ds = fr- ds if p is orientation- 


F(c(s))-e'(s)ds = — f F- ds if p is orientation- 
j reversing. E 
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Theorem 1 also holds for piecewise C! paths, as may be seen by breaking up the 
intervals into segments on which the paths are of class C! and summing the integrals 
over the separate intervals. 

Thus, if it is convenient to reparametrize a path when evaluating an integral, 
Theorem | assures us that the value of the integral will not be affected, except possibly 
for the sign, depending on the orientation. 


DGVB] Let F(x, y, z) = yzi + xzj - xyk and e: [—5, 10] — R? be de- 
fined by t (t, £?, t’). Evaluate f. F -ds and f. F-ds. 


SOLUTION For the path c, we have dx/dt =1, dy/dt =2t, dz/dt =3t?, and 
F(c(t)) = 9i + tj + 2k. Therefore, 


10 d. d d. 10 
[r7 [ FS qm = 4) as f (P + 215 4-365) dt = [t6]!% = 984, 375. 
c fe dt dt dt -5 


5 


On the other hand, for 


Cop: [—5, 10] > R3, £ > e(5 — t) = (5 — t, (5 — t}, (5 — t), 


we have dx/dt = —l, dy/dt = —10 + 2t = —2(5 — t), dz/dt = —75 + 30t 
3t? = —3(5 — t), and F(Cop(t)) = (5 — ti + (5 — t)tj + (5 — t)°k. Therefore, 


10 
[ F-ds [-(5 — t — 2(5 — t — 3(5 — t]dt = [(5 — t5], = —984,375. A 
Cop -5 


We are interested in reparametrizations, because if the image of a particular c can 
be represented in many ways, we want to be sure that path and line integrals depend 
only on the image curve and not on the particular parametrization. For example, for 
some problems the unit circle may be conveniently represented by the map p given 
by 


x(t) = cos 2t, y(t) = sin2t, O<t<nz. 


Theorem | guarantees that any integral computed for this representation will be the 
same as when we represent the circle by the map c given by 


x(t) — cost, y(t) = sint, 0<t<2z, 


7.2 Line Integrals 439 


because p=coh, where A(t) — 2t, and thus p is an orientation-preserving 
reparametrization of c. However, notice that the map y given by 


y(t) = (cost, sint), | 0t <4r 


is not a reparametrization of c. Although it traces out the same image (the circle), it 
does so twice. (Why does this imply that y is not a reparametrization of c?) 

The line integral is an oriented integral, in that a change of sign occurs (as 
we have seen in Theorem 1) if the orientation of the curve is reversed. The path 
integral does not have this property. This follows from the fact that changing t to —t 
(reversing orientation) just changes the sign of c'(r), not its length. This is one of the 
differences between line and path integrals. The following theorem, which is proved 
by the same method as Theorem 1, shows that path integrals are unchanged under 
reparametrizations—even orientation-reversing ones. 


THEOREM 2: Change of Parametrization for Path Integrals 
Let c be piecewise C, let f be a continuous (real-valued) function on the image 
of c, and let p be any reparametrization of c. Then 


[res = [6 ds Q) 
c P 


Line Integrals of Gradient Fields 


We next consider a useful technique for evaluating certain types of line integrals. 
Recall that a vector field F is a gradient vector field if F = Vf for some real-valued 
function f. Thus, 


_af., af. , ar 


F=—i+—j+—k. 
irap tz 


E 
Suppose g and G are real-valued continuous functions defined on a closed interval 


[a, b], that G is differentiable on (a, b), and that G' — g. Then by the fundamental 
theorem of calculus 


b 
[ g(x)dx = G(b) — Ga). 
Thus, the value of the integral of g depends only on the value of G at the endpoints 
of the interval [a, b]. Because Vf represents the derivative of f, one can ask whether 
I Vf -ds is completely determined by the value of f at the endpoints c(a) and c(5). 
The answer is contained in the following generalization of the fundamental theorem 
of calculus. 
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THEOREM 3: Line Integrals of Gradient Vector Fields Suppose 
that f: R? — R is of class C! and that c: [a, b] — IR? is a piecewise C! path. 
Then 


[Vr-t feto - fee. 


PROOF Apply the chain rule to the composite function 
F: t> f(D) 
to obtain 
F'(t) = (f oey(t) = Vf(c(t))- c). 


The function F is a real-valued function of the variable t, and so, by the fundamental 
theorem of single-variable calculus, 


b 
I F'(t)dt = F(b) — F(a) = f(e(b)) — f(c(a)). 


a 


Therefore, 


b b 
f Vf-ds= Y VAC) e) dt = ] F'(t)dt = F(b) — F(a) 
= f(e(b)) — f(e(a). m 
Let c be the path c(t) = (t*/4, sin' (17/2), 0), t € [0, 1]. Evaluate 


he +xdy 
€ 


(which means f. y dx +x dy + 0 dz). 


SOLUTION We recognize y dx +x dy, or equivalently, the vector field yi + 
xj 4- Ok, as the gradient of the function f(x, y, z) — xy. Thus, 


1 1 
[rx eit fea» f(0)— ;-1-0—1. A 


Obviously, if one can recognize the integrand as a gradient, then evaluation of 
the integral becomes much easier. For example, the reader should try to work out the 
integral in Example 9 directly. In one-variable calculus, every integral is, in principle, 
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obtainable by finding an antiderivative. For vector fields, however, this is not always 
true, because a given vector field need not be a gradient. This point will be examined 
in detail in Section 8.3, where we derive a test to determine when a vector field F is 
a gradient; that is, when F — Vf for some f. 


Line Integrals Over Geometric Curves 


We have seen how to define path integrals (integrals of scalar functions) and line 
integrals (integrals of vector functions) over parametrized curves. We have also seen 
that our work is simplified if we make a judicious choice of parametrization. Because 
these integrals are independent ofthe parametrization (except possibly for the sign), it 
seems natural to express the theory in a way that is independent of the parametrization, 
and that is thereby more “geometric.” We do this briefly and somewhat informally in 
the following discussion. 


DEFINITION We define a simple curve C to be the image of a piecewise C! map 
c: Z — R? that is one-to-one onan interval 7; c is called a parametrization of C. Thus, 
a simple curve is one that does not intersect itself (Figure 7.2.6). If J = [a, b], we 
call c(a) and c(5) endpoints of the curve. 


e(a) e(a) 
c y c d 
4$— — —e. ^. 
a b a b 


c(b) c(b) 


Simple curve Not a simple curve 


Figure 7.2.6 A simple curve that has no self-intersections is shown on the left. On 
the right is a curve with a self-intersection, so it is not simple. 


Each simple curve C has two orientations or directions associated with it. If P and Q 
are the endpoints of the curve, then we can consider C as directed either from P to 
Q or from Q to P. The simple curve C together with a sense of direction is called an 
oriented simple curve or directed simple curve (Figure 7.2.7). 


Q 
p m p Q Figure 7.2.7 There are two possible senses of direction 
er on a curve joining P and Q. 
P 
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DEFINITION: Simple Closed Curves By a simple closed curve we mean 
the image of a piecewise C! map e: [a, b] — IR? that is one-to-one on [a, b) and 
satisfies c(a) = c(b) (Figure 7.2.8). If c satisfies the condition e(a) = c(5), but 


is not necessarily one-to-one on [a, 5), we call its image a closed curve. Simple 
closed curves have two orientations, corresponding to the two possible directions 
of motion along the curve (Figure 7.2.9). 


e(a) = e(b) 
Figure 7.2.8 A simple closed curve 


(left) and a closed curve that is not 
simple (right). 


[^ C 
Figure 7.2.9 Two possible orientations for a 
simple closed curve C. 


If C is an oriented simple curve or an oriented simple closed curve, we may 
unambiguously define line integrals along them. 


Line Integrals and Path Integrals Over Oriented Simple Curves and 
Simple Closed Curves C: 


[ro [r5 and ] tas= ffas, (3) 


where c is any orientation-preserving parametrization of C. 


These integrals do not depend on the choice of c as long as c is one-to-one 
(except possibly at the endpoints) by virtue of Theorems 1 and 2.? The point we want 
to make here is that a/though a curve must be parametrized to make integration along 
it tractable, it is not necessary to include the parametrization in our notation for the 
integral. 


5We have not proved that any two one-to-one paths c and p with the same image must be reparametrizations of each other, 
but this technical point will be omitted. 
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If J = [a, b] is a closed interval on the x axis, then 7, as a curve, 

has two orientations: one corresponding to motion from a to b (left to right) and the 
other corresponding to motion from P to a (right to left). If f is a real-valued function 
continuous on /, then denoting / with the first orientation by /+ and J with the second 
orientation by / ^, we have 


ja f(x)dx = n - -f fG)dx = - f. füide. a 


A given simple closed curve can be parametrized in many different ways. Figure 
7.2.10 shows C represented as the image of a map p, with p(ż) progressing in a 
prescribed direction around an oriented curve C as t ranges from a to b. Note that 
p'(t) points in this direction also. The speed with which we traverse C may vary from 
parametrization to parametrization, but as long as the orientation is preserved, the 
integral will not, according to Theorems | and 2. 


P(t) 
p(s) 
S Figure 7.2.10 As t goes from a to b, p(t) 
V/ pla) = p(b) pt) moves around the curve C in some fixed 
direction. 


po) p'(to) 
eee o o o 


att hhb 


The following precaution should be noted in regard to these remarks. It is possible 
to have two mappings c and p with the same image, and inducing the same orientation 


on the image, such that 
[r2 [ras 
c P 


For an example, let c(t) = (cos t, sin t, 0) and p(t) = (cos 2t, sin 2t, 0), 0 < t < 27, 
with F(x, y, z) — (y, 0, 0). Then Fi(x, y, z) 2 y, Fo(x, y, z) 2 0, and FY(x, y, z)=0, 


so 
2x d. 2x 
fra- f Fi(e(t) 57 dt =- Í sin! tdt = —r. 
0 t 0 


c 


But J; F-ds =-2 Jus sin? 2t dt = —2z. Clearly, c and p have the same image, 
namely, the unit circle in the xy plane. Moreover, they traverse the unit circle in 
the same direction; yet I F:ds £ di F - ds. The reason for this is that c is one-to-one, 
but p is not (p traverses the unit circle twice in a counterclockwise direction); there- 
fore, p is not a parametrization of the unit circle as a simple closed curve. 
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As a consequence of Theorem 1 and generalizing the notation in Example 10, 
we introduce the following convention: 


Line Integrals Over Curves with Opposite Orientations Let C^ be the 
same curve as C, but with the opposite orientation. Then 


[r--[ F-ds. 
e is 


We also have: 


Line Integrals Over Curves Consisting of Several Components Let C be 
an oriented curve that is made up of several oriented component curves C;, i = 
l,...,k, as in Figure 7.2.11. Then we shall write C = Ci + C2 +--+ + Cj. 
Because we can parametrize C by parametrizing the pieces C1, . . . , C, separately, 
one can prove that 


fras] Feds+ [ rase | F. ds. (4) 
[o [e] C Cr 


C3 


C; Figure 7.2.11 A curve can be made up of several 
components. 


C 


One reason for writing a curve as a sum of components is that it may be easier to 
parametrize the components C; individually than it is to parametrize C as a whole. If 
that is the case, formula (4) provides a convenient way of evaluating fo F- ds. 


The dr Notation for Line Integrals 


Sometimes one writes, as we occasionally do later, the line integral using the notation 


[ra 
ic 


The reason is that we think of describing a C! path c in terms of a moving position 
vector based at the origin and ending at the point c(f) at time t. Position vectors are 
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often denoted by r= xi + yj + zk, and so the curve is described using the notation 
r(t) — x(t)i + y(Oj 4- z(t)k in place of c(t). By definition, the line integral is given by 


b 
d 
f F(r())- ^ dt. 


Formally canceling the d/'s, and using the parametrization independence to replace 
the limits of integration with the geometric curve C, we arrive at the notation 
fc F- dr. 

[s 


5955-7) Consider C, the perimeter of the unit square in R?, oriented in 
the counterclockwise sense (see Figure 7.2.12). Evaluate the line integral 


[ x? dx +xydy. 
c 


Figure 7.2.12 The perimeter of the unit square, 
parametrized in four pieces. 


SOLUTION We evaluate the integral using a convenient parametrization of C 
that induces the given orientation. For example: 


(t, 0) sti 
; 2 (1,1 — I) St< 
EAR, reja ny 22123 
(0,4 — 1) 3<t<4. 


Then 


1 2 
fria f Eod [ [0 4- (t — 1)]dt 
G 0 1 


3 4 
(3 — 4 
«ft (3 — ty «oar | 0-04 
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Now let us reevaluate this line integral, using formula (4) and parametrizing the 
C; separately. Notice that C = C, + C2 + C3 + C4, where C; are the oriented curves 
pictured in Figure 7.2.12. These can be parametrized as follows: 


Ci: 6(f) = (t,0),0<t <1 
C»: e(t)=(1,t),0<¢ <1 
Cy: en =(1—t,1),0<t <1 
C4: e4(t) = (0,1 —1),0<¢ <1, 


and so 


1 
f 2e» -[ dt =- 
e 0 3 


1 
1 
[ Pax tayay= f tdt =-= 
fon 0 2 


1 

1 
ELSE -(1—tf dt = —- 
C3 0 3 


1 
[ x! dx +xydy = [ Odt — 0. 
Cs 0 
Thus, again, 


ti 
2 =- — a 
f> dx+xydy=3 t7 3 P 


An interesting application ofthe line integral is the mathematical 
formulation of Ampére’s law, which relates electric currents to their magnetic effects.^ 
Suppose H denotes a magnetic field in IR*, and let C be a closed oriented curve in 
R3. In appropriate physical units, Ampére’s law states that 


] mer 
fo 


where / is the net current that passes through any surface bounded by C (see 
Figure 7.2.13). 4 


Finally, let us mention that the line integral has another important physical mean- 
ing, specifically, the interpretation of fọ V - ds as circulation, where V is the velocity 


©The discovery that electric currents produce magnetic effects was made by Haas Christian Oersted circa 1820. See any 
elementary physics text for discussions of the physical basis of these ideas. 


7.2 Line Integrals 447 


Figure 7.2.13 The magnetic field H surrounding a 
wire carrying a current J satisfies Ampére’s law: 
f; H- ds =I. 


Current 7 


field of a fluid, as we shall discuss in Section 8.2. Thus, a wide variety of physical 
concepts, from the notion of work to electromagnetic fields and the motions of fluids, 
can be analyzed with the help of line integrals. 


EXERCISES 
1. Let F(x, y, z) = xi + yj + zk. Evaluate the integral of F along each of the following 


paths: 
(a) e(t) — (t, t, t), 0xrxl (c) e(t) = (sint, 0, cost), O0<t<2n 
(b) c(t) = (cost, sint,0), | 0: «2x (d) e(t)=(t?,3t,20), —-l<t<2 


2. Evaluate each of the following line integrals: 


(a) f.xdy —ydx, c(t) = (cost, sin t), 0xt-x2n 

(b) f.xdx--ydy, — c(t) 2 (coszt,sinzt), 01-2 

(c) f.yzdx 4 xz dy + xy dz, where c consists of straight-line segments joining 
(1, 0, 0) to (0, 1, 0) to (0, 0, 1) 

(d) f x? dx — xy dy + dz, where c is the parabola z = x?, y = 0 from (—1, 0, 1) to 
(1,0, 1). 


3. Consider the force field F(x, y, z) = xi+ yj + zk. Compute the work done in moving a 
particle along the parabola y = x?, z = 0, from x = —1tox = 2. 


4. Let c be a smooth path. 


(a) Suppose F is perpendicular to c'(r) at the point c(t). Show that 


[ro 


(b) If F is parallel to c'(r) at c(t), show that 


fe ds= f MP1 ds. 
c c 


[By parallel to c'(/) we mean that F(c(r)) = A(r)e'(t), where A(t) > 0.] 
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5. Suppose the path c has length /, and ||F|| « M. Prove that 


[ras 
e 


6. Evaluate f. F- ds where F(x, y, z) = yi + 2xj + yk and the path c is defined by 
ce) 2 ti c 2j PK, 0x (x 1. 


< MI. 


7. Evaluate 


[reser —x)dy t zdz 


for each of the paths c(t) = (t, (^,0), 0 € t < 1, where n = 1,2, 3, .... 


8. This exercise refers to Example 12. Let L be a very long wire, a planar section of 
which (with the plane perpendicular to the wire) is shown in Figure 7.2.14. Suppose this 
plane is the xy plane. Experiments show that H is tangent to every circle in the xy plane 
whose center is the axis of L, and that the magnitude of H is constant on every such circle 
C. Thus, H — HT, where T is a unit tangent vector to C and H is some scalar. Using this 
information, show that H = //2zcr, where r is the radius of circle C and / is the current 
flowing in the wire. 


Figure 7.2.14 A planar section of a long wire and a 
curve C about the wire. 


9. The image of the path t + (cos? t, sin? 0), 0 € t < 2x in the plane is shown in Figure 
7.2.15. Evaluate the integral of the vector field F(x, y) — xi 4- yj around this curve. 


10. Suppose c, and c; are two paths with the same endpoints and F is a vector field. Show 
that Ja F-ds= fo F - ds is equivalent to f. F* ds = 0, where C is the closed curve obtained 
by first moving along c, and then moving along c; in the opposite direction. 


11. Let e(¢) be a path and T the unit tangent vector. What is f. T - ds? 


12. Let F = (z? + 2xy)i + x?j + 3xz?k. Show that the integral of F around the 
circumference of the unit square with vertices (+1, +1) is zero. 
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0 n=e(3) 


Figure 7.2.15 The hypocycloid 
Ei c(t) = (cos? t, sin? t) (Exercise 9). 
(1, 0) = c(0) = e(27) 


(-1,0)=e(7) 


0-0- ($) 


13. Using the path in Exercise 9, observe that a C! map c: [a, b] > IR? can have an image 
that does not “look smooth." Do you think this could happen if e'(?) were always nonzero? 


14. What is the value of the integral of a gradient field around a closed curve C? 
15. Evaluate the line integral 
[ 2xyz dx + xzdy + x!y dz, 
where C is an oriented simple curve connecting (1, 1, 1) to (1, 2, 4). 
16. Suppose Vf(x, y, z) = 2xyze i + ze" j + ye" k. If f(0, 0, 0) = 5, find (1, 1,2). 


17. Consider the gravitational force field (with G = m = M = 1) defined [for (x, y, z) # 
(0, 0, 0)] by 


1 OT 
F(x,y,z)— DL (xi 4- yj + zk). 


NES +2 
Show that the work done by the gravitational force as a particle moves from (xi, 1, zi) 
to (x2, Y», z2) along any path depends only on the radii Rj = yx? + y? + zi and 


R = JA +y +z. 


18. A cyclist rides up a mountain along the path shown in Figure 7.2.16. She makes one 
complete revolution around the mountain in reaching the top, while her vertical rate of climb 
is constant. Throughout the trip she exerts a force described by the vector field 

F(x, y, z) = yi - xj +k. 


What is the work done by the cyclist in traveling from A to B? What is unrealistic about this 
model of a cyclist? 
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Figure 7.2.16 How much work is done in 
cycling up this mountain? 


19. Let c: [a, b] > IR? be a path such that c'(r) 4 0. Recall from Section 4.1 that when 
this condition holds, c is said to be regular. Let the function f be defined by the formula 


f(x) = [f Well at. 


(a) What is df/dx? 

(b) Using the answer to part (a), prove that f: [a, b] — [0, L], where L is the length of 
c, has a differentiable inverse g: [0, L] — [a, b] satisfying f o g(s) = s, go f(x) =x. (You 
may use the one-variable inverse function theorem stated at the beginning of Section 3.5.) 

(c) Compute dg/ds. 

(d) Recall that a path s ++ b(s) is said to be of unit speed, or parametrized by arc length, 
if ||b’(s)|| = 1. Show that the reparametrization of c given by b(s) = c o g(s) is of unit speed. 
Conclude that any regular path can be reparametrized by arc length. (Thus, for example, the 
Frenet formulas in Exercise 17 of Section 4.2 can be applied to the reparametrization b.) 


20. Along a “thermodynamic path" C in (V, T, P) space, 


(i) The heat gained is Je Ay dV + Ky dT, where Ay, Ky are functions of (V, T, P), 
depending on the particular physical system. 
(ii) The work done is fẹ P dV. 


For a van der Waals gas, we have 


k; T = A JAy — E and Ky — constant, 


PY. T= Wap 


where R, b, a, and J are known constants. Initially the gas is at a temperature 7) and volume 
Vo. 


(a) An adiabatic process is a thermodynamic motion (V (t), T (t), P(t)) for which 


dT  dT/dt — My 
dV  dV/dt Ky’ 


If the van der Waals gas undergoes an adiabatic process in which the volume doubles to 2Vo, 
compute 


(1) the heat gained; 
(2) the work done; and 
(3) the final volume, temperature, and pressure. 
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(b) After the process indicated in part (a), the gas is cooled (or heated) at constant 
volume until the original temperature Tọ is achieved. Compute 


(1) the heat gained; 


(2) the work done; and 
(3) the final volume, temperature, and pressure. 


(c) After the process indicated in part (b), the gas is compressed until the gas returns to 
its original volume Vo. The temperature is held constant throughout the process. Compute 


(1) the heat gained; 
(2) the work done; and 
(3) the final volume, temperature, and pressure. 


(d) For the cyclic process described in parts (a), (b), (c), compute 


(1) the total heat gained; and 
(2) the total work done. 


7.3 Parametrized Surfaces 


In Sections 7.1 and 7.2, we studied integrals of scalar and vector functions along 
curves. Now we turn to integrals over surfaces and begin by studying the geometry 
of surfaces themselves. 


Graphs Are Too Restrictive 


We are already used to one kind of surface, namely, the graph of a function f(x, y). 
Graphs were extensively studied in Chapter 2, and we know how to compute their 
tangent planes. However, it would be unduly limiting to restrict ourselves to this case. 
For example, many surfaces arise as level surfaces of functions. Suppose our surface 
S is the set of points (x, y, z) where x — z +z? = 0. Here S is a sheet that (relative 
to the xy plane) doubles back on itself (see Figure 7.3.1). Obviously, we want to 


(xo. yo. 0) 
Figure 7.3.1 A surface that is not the 
graph of a function z = f(x, y). 
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call S a surface, because it is just a plane with a wrinkle. However, S is not the graph 
of some function z = f(x, y), because this means that for each (xo, yo) € R? there 
must be one zo with (xo, yo, zo) € S. As Figure 7.3.1 illustrates, this condition is 
violated. 

Another example is the torus, or surface of a doughnut, which is depicted in 
Figure 7.3.2. Anyone would call a torus a surface; yet, by the same reasoning as 
before, a torus cannot be the graph of a differentiable function of two variables. These 
observations encourage us to extend our definition of a surface. 


Figure 7.3.2 The torus is not the graph of a 


" — function of the form z = f(x, y). 


The motivation for the extended definition that follows is partly that a surface can 
be thought of as being obtained from the plane by “rolling,” “bending,” and “pushing.” 
For example, to get a torus, we take a portion of the plane and roll it (see Figure 7.3.3), 
then take the two “ends” and bring them together until they meet (Figure 7.3.4). 


ME 


Figure 7.3.3 The first step in obtaining a torus from a rectangle is to make a 
cylinder. 


— — — d — C 
Ends glued 


Figure 7.3.4 Bend the cylinder and glue the ends to get a torus. 


7.3 Parametrized Surfaces 453 


Parametrized Surfaces as Mappings 


Inour study of differential calculus we dealt with mappings f: 4 C IR" — IR", Taking 
n — 2 and m — 3 corresponds to the case of a two-dimensional surface in 3-space. 
With surfaces, just as with curves, we want to distinguish a map (a parametrization) 
from its image (a geometric object). This leads us to the following definition. 


DEFINITION: Parametrized Surfaces A parametrization of a surface is 
a function $: D C R? — R?, where D is some domain in R*. The surface S 
corresponding to the function 4 is its image: S = (D). We can write 


(u, v) = (x(u, v), y(u, v), z(u, v)). 
If ® is differentiable or is of class C! [which is the same as saying that 


x(u, v), y(u, v), and z(u, v) are differentiable or C! functions of (u, v)], we call 
S a differentiable or a C! surface. 


We can think of ® as twisting or bending the region D in the plane to yield the 
surface S (see Figure 7.3.5). Thus, each point (u, v) in D becomes a label for a point 
(x(u, v), y(u, v), z(u, v)) on S. 


x 


Figure 7.3.5 ® “twists” and “bends” D onto the surface $ = (D). 


Of course, surfaces need not bend or twist at all. In fact, planes are flat, as shown in 
our first, and simplest, example. 


Sy 8S] In Section 1.3 we studied the equation of a plane P. We did so in 
terms of graphs and level sets. Now we examine the same notion using a parametri- 
zation. 
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Let P be a plane that is parallel to two vectors œ and 8 and that passes through 
the tip of another vector y, as in Figure 7.3.6. 


Figure 7.3.6 Describing a plane 
parametrically. 


Our goal in this example is to find a parametrization of this plane. Notice that 
the vector œ x  =N, which we also write as Ai + Bj + Ck, is normal to P. If the 
tip of ^y is the point (xo, yo, zo), then the equation of P as a level set (as discussed in 
Section 1.3) is given by: 


A(x — xo) + BCy — yo) + C(z — zo) = 0. 


However, the set of all points on the plane P can also be described by the set of all 
vectors that are ^y plus a linear combination of œ and 8. Using our preferred choice 
of real parameters u and v, we arrive at the parametric equation of the plane P: 


(u, v) 2 aud Bv y. à 


Tangent Vectors to Parametrized Surfaces 


Suppose that ® is a parametrized surface that is differentiable at (uo, vo) € R?. Fixing 
u at uo, we get a map IR — R? given by t +> ®(uo, t), whose image is a curve on the 
surface (Figure 7.3.7). From Chapters 2 and 4 we know that the vector tangent to this 
curve at the point (uo, vo), which we denote by T,, is given by 

a® 


ax ., Oy -r OF 
Ty = áp gy (to Wi + 3, Uo vo)j + gy (o: Vok. 


Similarly, if we fix v and consider the curve t > ®(t, vo), we obtain the tangent 
vector to this curve at (uo, vo), given by 
09 


Ox dy üz 
- - i i k. 
T, ju au io? vo)i + a, Uo vo)j + a, Uo vo) 


Un 
un 
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A (o, vo) 


v= constant 


Figure 7.3.7 The tangent vectors T, and T, that are tangent to the curve on a 
surface S, and hence tangent to S. 


Regular Surfaces 


Because the vectors T, and T, are tangent to two curves on the surface at a given 
point, the vector T, x T, ought to be normal to the surface at the same point. 

We say that the surface S is regular or smooth’ at ®(uo, vo), provided that 
T, x T, Æ 0 at (uo, vo). The surface is called regular if it is regular at all points 
(uo, vo) € S. The nonzero vector T, x T, is normal to S (recall that the vector 
product of T, and T, is perpendicular to the plane spanned by T, and T,); the fact 
that it is nonzero ensures that there will be a tangent plane. Intuitively, a smooth 
surface has no “corners.”® 


S'S") Consider the surface given by the equations 


X —ucosv, y —usinv, gu, uz. 


Is this surface differentiable? Is it regular? 


SOLUTION These equations describe the surface z = yx? + y? (square the 
equations for x, y, and z to check this), which is shown in Figure 7.3.8. This sur- 
face is a cone with a “point” at (0, 0, 0); it is a differentiable surface because each 
component function is differentiable as a function of u and v. However, the surface 


"Strictly speaking, regularity depends on the parametrization ® and not just on its image S. Therefore, this terminology 
is somewhat imprecise; however, it is descriptive and should not cause confusion. (See Exercise 15.) 


SIn Section 3.5, we showed that level surfaces f(x, y, z) = 0 were in fact graphs of functions of two variables in some 
neighborhood of a point (xo, yo, zo) satisfying V f(xo, yo, zo) # 0. This united two concepts of a surface—graphs and 
level sets. Again, using the implicit function theorem, it is likewise possible to show that the image of a parametrized 
surface 4 in the neighborhood of a point (uo, vo) where T, x T, # 0 is also the graph of a function of two variables. 
Thus, all definitions of a surface are consistent. (See Exercise 16.) 
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Figure 7.3.8 The surface z = yx? + y? is a cone. It is not 
regular at its tip. 


P f 
is not regular at (0, 0, 0). To see this, compute T,, and T, at (0, 0) € R?: 


ð ð a 9. 
Ty * (0, 0)i + ? (0, 0)j + Z (0, 0)k = (cos 0)i + (sin 0)j + k = i +k, 
du ðu ðu ðu 


and similarly, 


0d 
T, = 5, = ((-sinO)i + ((cos0)j + 0k = 0. 


Thus, T, x T, = 0, and so, by definition, the surface is not regular at (0,0,0). 4 


Tangent Plane to a Parametrized Surface 


We can use the fact that n = T, x T, is normal to the surface to both formally define 
the tangent plane and to compute it. 


DEFINITION: The Tangent Plane to a Surface If a parametrized surface 
®: D c R? > R is regular at (uo, vo), that is, if T, x T, Æ Oat (uo, vo), we 
define the tangent plane of the surface at (uo, vo) to be the plane determined 
by the vectors T, and T,. Thus, n = T, x T, isa normal vector, and an equation 
of the tangent plane at (xo, yo, zo) on the surface is given by 


(x — Xo, Y — yo, Z — zo): n = 0, (1) 


where n is evaluated at (uo, vo); that is, the tangent plane is the set of (x, y, z) 
satisfying (1). If m = (mi, n5, n3) = nii + naj + n3k, then formula (1) becomes 


ny(x — xo) + nj(y — yo) + ns(2 — zo) = 0. a) 
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DENE] Leto: R? — R? be given by 


X —ucosv, y —usinv, za? V. 


Where does a tangent plane exist? Find the tangent plane at (1, 0). 
SOLUTION We compute 


T, = (cos v)i + (sin v)j + 2uk and T, = —u(sin v)i + u(cos v)j + 2uk, 


so the tangent plane at the point (uo, vo) is the set of vectors through (uo, vo) 
perpendicular to 


(T, x Ty)(uo, vo) = (—21$ cos vo + 2o sin vo, —2u2 sin vo — 2vo cos vo, uo) 
if this vector is nonzero. Because T, x T, is equal to 0 at (uo, vo) = (0, 0), we cannot 
find a tangent plane at ®(0, 0) = (0, 0, 0). However, we can find an equation of the tan- 
gent plane at all the other points, where T, x T, Z 0. At the point ®(1, 0)=(1, 0, 1), 
n = (T, x T,)(1, 0) = (—2, 0, 1) = —2i + k. 


Because we have the vector n normal to the surface and a point (1, 0, 1) on the surface, 
we can use formula (1^) to obtain an equation of the tangent plane: 


—2(x —1)+(¢—1) = 0; that is, z =2x — 1. A 


[EXAMPLE 4] Suppose a surface S is the graph of a differentiable function 


g: R^ > R. Write S in parametric form and show that the surface is smooth at 
all points (uo, vo, g(uo, vo)) € R?. 


SOLUTION Write S in parametric form as follows: 
x=u, yor, z = gu, v), 
which is the same as z — g(x, y). Then at the point (uo, vo), 
T, =i+ M v)k and T,=j+ E (uo. vo)k, 
and for (uo, vo) € R2, 


9, ð, 
n = T, x T, = — (uo, vo)i — E (uo, vo)j + k # 0. (2) 
ðu ðv 
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This is nonzero because the coefficient of k is 1; consequently, the parametrization 
(u, v) > (u, v, g(u, v)) is regular at all points. Moreover, the tangent plane at the point 
(Xo, Yo, Zo) = (uo, vo, g(to, vo)) is given, by formula (1), as 


(= 0,9 mica C,H, J =0, 


where the partial derivatives are evaluated at (uo, vo). Remembering that x = u and 
y — v, we can write this as 


— (2e x» (3o yo), 6) 
bs ay 


where dg/dx and dg/dy are evaluated at (xo, yo). á 


This example also shows that the definition of the tangent plane for parametrized 
surfaces agrees with the one for surfaces obtained as graphs, because equation (3) is 
the same formula we derived (in Chapter 2) for the plane tangent to S at the point 
(Xo, Yo, zo) € S. 

It is also useful to consider piecewise smooth surfaces, that is, surfaces com- 
posed of a certain number of images of smooth parametrized surfaces. For exam- 
ple, the surface of a cube in IR? is such a surface. These surfaces are considered in 
Section 7.4. 


CWOPBES] Find a parametrization for the hyperboloid of one sheet: 
x? + y? -z2 zl. 


SOLUTION Because x and y appear in the combination x? + y’, the surface is 
invariant under rotation about the z axis, and so it is natural to write 


x =rcosé, y —rsin8. 


Then x? + y? — z? = | becomes r? — z? = 1. This we can conveniently parametrize 
by? 


r — cosh u, z — sinh u. 
Thus, a parametrization is 
x — (cosh u)(cos 0), y = (cosh u)(sin 0), z — sinh u, 
where 0 < 0 < 2m, —œ0 <u < œ. A 


?Recall from one-variable calculus that cosh u = (e" + e™")/2 and sinhu = (e" — e™")/2. One easily verifies from these 
definitions that cosh? u — sinh? u = 1. 
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EXERCISES 


In Exercises 1 to 3, find an equation for the plane tangent to the given surface at the specified 
point. 


1. x =2u, y=u +v, z=v’, at (0, 1, 1) 


ERU ROT: = mage 1d 
Rxesw—-v, ysutv, z=u?+4v,  at(-l 42) 
X xcu, y — usine", z= lu cose”, at (13, —2, 1) 
4. At what points are the surfaces in Exercises | and 2 regular? 


5. Find an expression for a unit vector normal to the surface 
X = cos v sinu, y — sinvsinu, Z=cosu 


at the image of a point (u, v) for u in [0, x] and v in [0, 27]. Identify this surface. 


6. Repeat Exercise 5 for the surface 
x =3cos@sing, y =2sin@ sing, z —cosó 


for 0 in [0, 27] and ¢ in [0, 7]. 


7. Repeat Exercise 5 for the surface 
x — sinv, y-u, Z=cosv 


for 0 x v < 2m and —1 <u <3. 


8. Repeat Exercise 5 for the surface 
x = (2 — cosv)cosu, y = (2 — cos v)sinu, z —Sinv 


for —x <u <n, —r < v x m. Is this surface regular? 


9. (a) Develop a formula for the plane tangent to the surface x = A( y, z). 
(b) Obtain a similar formula for y = k(x, z). 


10. Find the equation of the plane tangent to the surface x = u?, y = v^, z = u? + v? at the 
point u = l,v = 1. 


11. Find a parametrization of the surface z = 3x? + 8xy and use it to find the tangent plane 
atx = 1, y = 0, z = 3. Compare your answer with that using graphs. 


12. Find a parametrization of the surface x? +3xy +z? —2, z > 0, and use it to find the 
tangent plane at the point x = 1, y = 1/3, z = 0. Compare your answer with that using level 
sets. 


13. Consider the surface in IR? parametrized by 


®(r, 0) = (r cos, r sin6, 0), 0<rz<l and 0x0 <4r. 
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(a) Sketch and describe the surface. 

(b) Find an expression for a unit normal to the surface. 

(c) Find an equation for the plane tangent to the surface at the point (xo, yo, zo). 

(d) If (xo, yo, zo) is a point on the surface, show that the horizontal line segment of unit 
length from the z axis through (xo, Yo, zo) is contained in the surface and in the plane tangent 
to the surface at (xo, yo, zo). 


14. Given a sphere of radius 2 centered at the origin, find the equation for the plane that is 
tangent to it at the point (1, 1, 2) by considering the sphere as: 


(a) a surface parametrized by ®(6, 9) = (2cos0 sing, 2 sin 8 sing, 2 cos); 
(b) a level surface of f(x, y, z) = x? + y? +27; and 
(c) the graph of g(x, y) = /4 — x? — y?. 


15. (a) Find a parametrization for the hyperboloid x? + y? — z? = 25. 

(b) Find an expression for a unit normal to this surface. 

(c) Find an equation for the plane tangent to the surface at (xo, yo, 0), where 
XQ ye —25. 

(d) Show that the lines (xo, yo, 0) + t(—yo, xo, 5) and (xo, yo, 0) + t( yo, —xo, 5) lie in 
the surface and in the tangent plane found in part (c). 


16. A parametrized surface is described by a differentiable function ®: IR? > R°. 
According to Chapter 2, the derivative should give a linear approximation that yields a 
representation of the tangent plane. This exercise demonstrates that this is indeed the case. 


(a) Assuming T, x T, Æ 0, show that the range of the linear transformation D®(uo, vo) 
is the plane spanned by T, and T,. [Here T, and T, are evaluated at (uo, vo).] 

(b) Show that wL (T, x T,) if and only if w is in the range of D®(uo, vo). 

(c) Show that the tangent plane as defined in this section is the same as the 
“parametrized plane" 


(u, v) e (uo, vo) + D&(uo, wf B 4 


17. Consider the surfaces «b, (u, v) = (u, v, 0) and &;(u, v) = (i? v?, 0). 


(a) Show that the image of ®, and of 4; is the xy plane. 

(b) Show that ®, describes a regular surface, yet ®2 does not. Conclude that the notion 
of regularity of a surface S depends on the existence of at least one regular parametrization 
for S. 

(c) Prove that the tangent plane of S is well defined independently of the regular 
(one-to-one) parametrization (you will need to use the inverse function theorem from 
Section 3.5). 

(d) After these remarks, do you think you can find a regular parametrization of the cone 
of Figure 7.3.7? 


18. Let be a regular surface at (uo, vo); that is, ® is of class C! and T, x T, £0 at (uo, vo). 


(a) Use the implicit function theorem (Section 3.5) to show that the image of near 
(uo, vo) is the graph of a C' function of two variables. If the z component of T, x T, is 
nonzero, we can write itasz — f(x, y). 

(b) Show that the tangent plane at (uo, vo) defined by the plane spanned by T, and T, 
coincides with the tangent plane of the graph of z = f(x, y) at this point. 
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7.4 Area of a Surface 


Before proceeding to general surface integrals, let us first consider the problem of 
computing the area of a surface, just as we considered the problem of finding the arc 
length of a curve before discussing path integrals. 


Definition of Surface Area 


In Section 7.3, we defined a parametrized surface S to be the image of a function 
$: D C R? — R’, written as B(u, v) = (x(u, v), y(u, v), z(u, v)). The map was 
called the parametrization of S and S was said to be regular at (u, v) € S provided 
that T, x T, 4 0, where 


Ox oy az 
T, i »v)j " 
7 au v)i 4- 2; v)j + Ju" v)k 


and 


a a a 
T, = Z (u, i+ 2, v)j + Ze, vk. 
ðv ðv ðv 


Recall that a regular surface (loosely speaking) is one that has no corners or breaks. 

In the rest of this chapter and in the next one, we shall consider only piecewise 
regular surfaces that are unions of images of parametrized surfaces ®;: D; — IR? for 
which: 


(i) D; is an elementary region in the plane; 
(ii) ®; is of class C! and one-to-one, except possibly on the boundary of D;; and 


(iii) S;, the image of 4; is regular, except possibly at a finite number of points. 


DEFINITION: Area of a Parametrized Surface We define the surface 
area? A(S) of a parametrized surface by 


a= [f IT, x T, dudv a) 


where ||T,, x T, || is the norm of T, x T,. If S is a union of surfaces S;, its area 


is the sum of the areas of the S;. 
TAERAA. 


10 As we have not yet discussed the independence of parametrization, it may seem that A(S ) depends on the parametrization 
®. We shall discuss independence of parametrization in Section 7.6; the use of this notation here should not cause 
confusion. 
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As the reader can easily verify, we have 
_ Ment , [3052 , [96,2] 
addc LES 5] * [eo] "el " 


ax ax 
ay) |9u av 
O(u, v) dy day 
ðu ðv 


where 


and so on. Thus, formula (1) becomes 


ac, 2 [452 ? [96.2] 
ay [hl ES 2] En 2] + 2] audu. 6) 


Justification of the Area Formula 


We can justify the definition of surface area by analyzing the integral [fp ||T,, x 
T, || du dv in terms of Riemann sums. For simplicity, suppose D is a rectangle; con- 
sider the nth regular partition of D, and let R;; be the ith rectangle in the partition, 
with vertices (uj, vj), (Ui+1, Vj), (ui, vj41), and (uii, vj) 0 i sn—-1,0< 
j € n — 1. Denote the values of T, and T, at (u;, vj) by Tą, and T,,. We can think of 
the vectors AuT,, and AvT,, as tangent to the surface at ®(u;, vj) = (xij, Vij, Zij) 
where Au = uj41 — uj, Av = vj44 — vj. Then these vectors form a parallelogram 
Pjj that lies in the plane tangent to the surface at (x;;, yjj, zij) (see Figure 7.4.1). 


(x Yi zy) 


Figure 7.4.1. ||T,, x Ty, Au Av is equal to the area of a parallelogram that 
approximates the area of a patch on a surface S$ = (D). 
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We thus have a “patchwork cover" of the surface by the P;;. For n large, the area 
of P;; is a good approximation to the area of @(R;;). Because the area of the par- 
allelogram spanned by two vectors v; and vz is ||vi x v2|| (see Chapter 1), we see 
that 


ACP) = |AuTy, x AvT, || = Tu, x Ty ]Au Av. 


Therefore, the area of the patchwork cover is 


n-l n-1l n—l n-l 
A, = 9 2 AP) = YI DWT a, x Ty] Au Av. 
i=0 j=0 i=0 j=0 


As n — co, the sums A, converge to the integral 


I IT, x T.I du dv. 
D 


Because 4, should approximate the surface area better and better as n — oo, we are 
led to formula (1) as a reasonable definition of A(S). 


DAA Let D be the region determined by 0 < 0 x 27,0 <r < Land let 
the function ®: D — R’, defined by 


x =rcosé, y —rsin6, zc 


be a parametrization of a cone S (see Figure 7.3.8). Find its surface area. 
SOLUTION In formula (3), 


ay) — le =r zA EN 

8(r,0)  lsinO rcosü| ^" 

a(y,z) — s rcosÓ| | o 

8(,0) | 1 Om [I ? 
and 

a(x,z)  |cos6 —r sind =rane 

ar,@) | 1 OS : 


so the area integrand is 


IT, x Toll = Vr? +72 cos? 0 +r? sin? 0 = r2. 
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Clearly, || T, x To|| vanishes for r = 0, but (0, 0) = (0, 0, 0) for any 0. Thus, 
(0, 0, 0) is the only point where the surface is not regular. We have 


2x 1 2m 
I IT, x Toldrao = [ f Vir drdo = | 5240 = Vn. 
D 0 0 0 


To confirm that this is the area of (D), we must verify that ® is one- 
to-one (for points not on the boundary of D). Let D be the set of (r, 0) with 
0 <r <1 and 0 <@ <2z. Hence, D? is D without its boundary. To see that 
®: D? — R? is one-to-one, assume that ®(r, 0) = B(r’, 0^) for (r, 0) and (r’, 6’) € 
D. Then 


r cosü =r" cos", r sin — r' sing’, Fer. 


From these equations it follows that cos@ = cos 0’ and sin@ = sin 6'. Thus, either 
0 = 0' or 0 = 0' + 21 n. But the second case is impossible for n a nonzero integer, 
because both 0 and 6’ belong to the open interval (0, 27r), and thus cannot be more than 
2x radians apart. This proves that off the boundary, ® is one-to-one. (Is : D — R? 
one-to-one?) In future examples, we shall not usually verify that the parametrization 
is one-to-one when it is intuitively clear. A 


UO WSIUESM A helicoid is defined by ®: D > R?, where 


x —rcos6, y —rsin6, go 


and D is the region where 0 < 0 < 2z and 0 <r < 1 (Figure 7.4.2). Find its area. 


Figure 7.4.2 The helicoid 
x =r cos0, y =r sinb, z — 0. 
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SOLUTION We compute d(x, y)/8(r, 0) = r as in Example 1, and 


a(y, 2 

(y. z) = [in rest —! 
ar, 0) 

a(x, Zz) _ po —rsinó| _ cg 
8(n80) | 0 l. = ` 


The area integrand is therefore 4/r? + 1, which never vanishes, so the surface is 
regular. The area of the helicoid is 


P 1 
f IIT, x Tell dr de = { f Vr? + 1drd0 = an f vr? + 1dr. 
D o Jo 0 


After a little computation (using the table of integrals), we find that this integral is 
equal to 


z[V2 -log(14- V2). A4 


Surface Area of a Graph 


A surface S given in the form z = g(x, y), where (x, y) € D, admits the parametri- 
zation 


xcu, y=v, z=g(u,v) 


for (u, v) € D. When g is of class C!, this parametrization is smooth, and the formula 
for surface area reduces to 


fL Res 9 


after applying the formulas 


9, 9, 
T,=i+ Sk T,=j+k, 
ðu ðv 
and 
ag, Og. 8g. 9g, 
T, T,= k=- k, 
diis. us wit ax aye 


as noted in Example 4 of Section 7.3. 


466 Integrals Over Paths and Surfaces 


Surfaces of Revolution 


In most books on one-variable calculus, it is shown that the lateral surface area 
generated by revolving the graph of a function y = f(x) about the x axis is given by 


b 
A=2n | ULOVI 4 LP GP) dx. (5) 


If the graph is revolved about the y axis, the surface area is 


b 
A=2n Í (Ix / T UP GT) ax. (6) 


We shall derive formula (5) by using the methods just developed; one can obtain 
formula (6) in a similar fashion (Exercise 10). 

To derive formula (5) from formula (3), we must give a parametrization of S. 
Define the parametrization by 


Uu y = f(u)cosv, z — f(u)sinv 
over the region D given by 
a<u<b, Oxvcz2m. 
This is indeed a parametrization of S, because for fixed u, the point 
(u, f(u) cos v, f(u) sin v) 


traces out a circle of radius | f(u)| with the center (u, 0, 0) (Figure 7.4.3). 


Figure 7.4.3 The curve y = f(x) rotated 
about the x axis. 


Circumference = 27| f (x)| 


We calculate 


a(x, z) 
O(u, v) 


ay) _ 
afu, v) -— 


9(yz) _ 
a(u,v) -— 


SWS W, = f(u)cosv, 


— f (u)sin v, 
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and so by formula (3) 


ax, ny [2 * fazat 
i Ih E | + [Fea + Teal ines 


=f | JIFDI sin? v + LFGOPLP'GOP + LFGOP cos? v du dv 
D 


= If IF@IV1 + L/P du dv 
b p2x 
zi [ VF) 1 +L (WP dv du 


b 
=2n [ OESTE 


which is formula (5). 
If S is the surface of revolution, then 27 | f (x)| is the circumference of the vertical 
cross section to S at the point x (Figure 7.4.3). Observe that we can write 


b 
m f IOVI + [f'E dx = [2x760145. 


where the expression on the right is the path integral of 27r | f (x)| along the path given 
byc: [a, b] > R?, t e (t, f(t). Therefore, the lateral surface ofa solid of revolution 
is obtained by integrating the cross-sectional circumference along the path that is the 
graph of the given function. 


The most famous mathematician in ancient times was Archimedes. In 
addition to being an extraordinarily gifted mathematician, he was also an 
engineering genius on a scale never before seen and was greatly admired by 
his contemporaries and by later writers for his insights into mechanics. It was 
these talents that helped the people of the city of Syracuse in 214 B.c. to 
defend their city against the onslaught of the Roman legions under their 
commander Marcellus. 

When the Romans besieged the city, they encountered an enemy whom 
Archimedes had supplied—totally unexpectedly—with powerful weapons, 
including artillery and burning mirrors, which, as legend has it, incinerated 
the Roman fleet. 

. Thesiege of Syracuse lasted two years, and the city finally fell as a result 
of acts of treason. In the aftermath of the assault, the old scientist was slain 
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by a Roman soldier, even though the commander had asked his men to spare 
Archimedes' life. As the story goes, Archimedes was sitting in front of his 
house studying some geometric figures he had drawn in the sand. When a 
Roman soldier approached, Archimedes shouted out, “Don’t disturb my 
figures!” The ruffian, feeling insulted, slew Archimedes. 

To honor this great man, Marcellus erected a tomb for Archimedes on 
which, according to Archimedes'sown wishes, were depicted a cone, a 
sphere, and a cylinder (Figure 7.4.4). 


— X1 X. 


Figure 7.4.4 Archimedes’ theorem: The ratios of the volumes of a cone, a 
half ball, and a cylinder, all of the same height and radius, are 1:2:3. 


Archimedes was incredibly proud of his calculation of the volume and 
surface area of the sphere, which justifiably were seen as truly outstanding 
accomplishments for their time. As in his works on centers of gravity, for 
which he provided no clear definition, Archimedes was able to compute the 
surface area of the sphere without having a clear definition of precisely what 
it was. However, as with many mathematical works, one knows the answer 
long before a proof or even the correct definition can be found. 

The problem of properly defining surface areas is a difficult one. To 
Archimedes’ credit, a careful theory of surface areas was not achieved until 
the twentieth century, after a long development that began in the seventeenth 
century with the discovery of calculus. 

Christiaan Huygens (1629-1695) was the first person since Archimedes 
to give results on the areas of special surfaces beyond the sphere, and he 
obtained the areas of portions of surfaces of revolution, such as the 
paraboloid and hyperboloid. 

The brilliant and prolific mathematician Leonhard Euler (1707-1783) 
presented the first fundamental work on the theory of surfaces in 1760 
with Recherches sur la courbure des surfaces. However, it was in 1728, ina 
paper on shortest paths on surfaces, that Euler defined a surface as a graph 
z= f(x,y). Euler was interested in studying the curvature of surfaces, and in 
1771 he introduced the notion of the parametric surfaces that are described in 
this section. 

After the rapid development of calculus in the early eighteenth century, 
formulas for the lengths of curves and areas of surfaces were developed. 
Although wedo not know when all the area formulas presented in this section 
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first appeared, they were certainly common by the end of the eighteenth 
century. The underlying concepts of the length of a curve and the area of a 
surface were understood intuitively before this time, and the use of formulas 
from calculus to compute areas was considered a great achievement. 

Augustin-Louis Cauchy (1789-1857) was the first to take the step of 
defining the quantities of length and surface areas by integrals as we have 
presented in this book. The question of defining surface area independent of 
integrals was taken up somewhat later, but this posed many difficult 
problems that were not properly resolved until this century. 

We end this section by describing the fascinating classic area problem 
of Plateau, which has enjoyed a long history in mathematics. The Belgian 
physicist Joseph Plateau (1801—1883) carried out many experiments from 
1830 to 1869 on surface tension and capillary phenomena, experiments that 
had enormous impact at the time and were repeated by notable nineteenth- 
century physicists, such as Michael Faraday (1791-1867). The 
corresponding collection of mathematical problems relating to soap 
films was named in 1904 after Plateau by the great French mathematician 
Henri Lebesgue (1875-1941). 

If a wire is dipped into a soap or glycerine solution, then one usually 
withdraws a soap film spanning the wire. Some examples are given in 
Figure 7.4.5, although readers might like to perform the experiment for 
themselves. Plateau raised the mathematical question: For a given boundary 
(wire), how does one prove the existence of such a surface (soap film) and 
how many surfaces can there be? The underlying physical principle is that 
nature tends to minimize area; that is, the surface that forms should bea 
surface of least area among all possible surfaces that have the given curve as 
their boundary. This again is another example of the action principle of 
Maupertuis and Leibniz (c.f. Section 3.3). 


Figure 7.4.5 Two soap films spanning wires. 
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For soap film surfaces that are disklike, the problem can be formulated 
in the following way. Let D C IR? be the unit disk defined to be the set 
f(x, y) | x? + y? < 1} and let 8D be its boundary. Furthermore, suppose that 
the image I of c: [0, 277] — IR? isa simple closed curve, F representing a 
wire in R?. 

Let $ be the set of all maps ®: D — R? such that (AD) = T, © is of 
class C!, and ® is one-to-one on 8D. Each € 8 represents a parametric C! 
"disklike" surface "spanning" the wire I. 

The soap films in Figure 7.4.5 are not disklike, but represent a system of 
multiple disklike surfaces. Figure 7.4.6 shows a contour that bounds two 
disklike surfaces and one nondisklike surface. 


Figure 7.4.6 Soap film surfaces; (b) and (c) are disklike surfaces, but (a) 
is not. 


For each ® € 8, consider the area of the image surface, namely, A(®) = 
iy p Tu x T,|| du dv. This area is a function that assigns to each parametric 
surface its area. Plateau asked whether A has a minimum on §; that is, does 
there exist a Po such that A(®y) < A(®) for all & € $? Unfortunately, the 
methods of this book are not adequate to solve this problem. We can tackle 
questions of finding minima of real-valued functions of several variables, but 
in no way can the set $ be thought of as a region in IR" for any n! 

In his own study of surfaces of least area, Weierstrass showed that if a 
minimum 


Solu, v) = (x(u, v), y(u, v), z(u, v)) 


existed at all, it would have to satisfy (after suitable normalizations) the 
partial differential equations 


à) V^, = 0 


Ln 
du 


0^, 
dv 


(iii) | 
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where ||w|| denotes the "norm"or length of the vector w. This example 
illustrates the intimate connections between problems of maxima and 
minima (the calculus of variations) and the subject of partial differential 
equations. 

For well over 70 years, mathematicians such as Riemann, Weierstrass, 


H. A. Schwarz, Darboux, and Lebesgue puzzled over the challenge posed by 
Plateau. In 1931 the question was finally settled when Jesse Douglas showed 
that such a Bo existed. However, many questions about soap films remain 
unsolved, and this area of research is still active today." 


EXERCISES 


1. Find the surface area of the unit sphere S represented parametrically by ®: D > 
S c R?, where D is the rectangle 0 < 0 < 27,0 < $ < x and @ is given by the 
equations 


x = cosÓ sinó, y — sin sing, z — cosó. 


Note that we can represent the entire sphere parametrically, but we cannot represent it in the 
formz — f(x, y). 


2. In Exercise 1, what happens if we allow $ to vary from —z/2 to 2/2? From 0 to 277? 
Why do we obtain different answers? 


3. Find the area of the helicoid in Example 2 if the domain D is0 < r x 1 and0 < 0 < 37. 


4. The torus T can be represented parametrically by the function ®: D — R?, where is 
given by the coordinate functions x = (R + cos 9) cos @, y = (R + cosó)sin0,z = sing; D 
is the rectangle [0, 27r] x [0, 27], that is, 0 < 0 < 2z,0 < $ < 27; and R > 1 is fixed (see 
Figure 7.4.7). Show that A(T) = (27)’R, first by using formula (3) and then by using 
formula (6). 


5. Let (u, v) = (u — v, u + v, uv) and let D be the unit disk in the uv plane. Find the area 
of &(D). 


6. Find the area of the portion of the unit sphere that is cut out by the cone z > x? + y? 
(see Exercise 1). 


7. Show that the surface x = 1/,/y? + z2, where 1 < x < oo, can be filled but not painted! 


8. Find a parametrization of the surface x? — y? = 1, where x > 0, —1 < y < 1 and 
0 < z < 1. Use your answer to express the area of the surface as an integral. 


1 For more information on this fascinating subject, the reader may consult The Parsimonious Universe: Shape and Form 
in the Natural World, by S. Hildebrandt and A. Tromba, Springer-Verlag, New York/Heidelberg, 1995. 
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Figure 7.4.7 A cross section of a torus. 


9. Represent the ellipsoid E: 


2 2 2 
y 
a p E 


parametrically and write out the integral for its surface area A(E). (Do not evaluate the 
integral.) 


10. Let the curve y = f(x), a < x < b, be rotated about the y axis. Show that the area of the 
surface swept out is given by equation (6); that is, 


A =n f Ixi/ 1 + [POP dx. 


Interpret the formula geometrically using arc length and slant height. 


11. Find the area of the surface obtained by rotating the curve y = x?, 0 < x < 1, about the 
y axis. 


12. Use formula (4) to compute the surface area of the cone in Example 1. 
13. Find the area of the surface defined by x + y +z = 1, x? +2y° <1. 


14. Show that for the vectors T, and T,, we have the formula 


_ [[96.2T [away , [96,2] 
IT, x Toll EA + [2] e] i 


15. Compute the area of the surface given by 
x=rcosé, y=2rcosð, z=0, O<r<l, O<6<2z. 


Sketch. 


16. Prove Pappus’ theorem: Let c: [a, b] > R? be a C! path whose image lies in the right 
half plane and is a simple closed curve. The area of the lateral surface generated by rotating 
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the image of c about the y axis is equal to 2: X/(c), where x is the average value of the x 
coordinates of points on c and /(c) is the length of c. (See Exercises 8 to 11, Section 7.1, 
for a discussion of average values.) 


17. The cylinder x? + y? = x divides the unit sphere S into two regions S; and S2, where S, 
is inside the cylinder and 5? outside. Find the ratio of areas 4(55)/ A(S). 


18. Suppose a surface S that is the graph of a function z = f(x, y), where (x, y) e Dc R? 
can also be described as the set of (x, y, z) € R? with F(x, y, z) = 0 (a level surface). Derive 
a formula for A(S) that involves only F. 


19. Calculate the area of the frustum shown in Figure 7.4.8 using (a) geometry alone and, 
second, (b) a surface area formula. 


Figure 7.4.8 A line segment revolved around the y axis 
becomes a frustum of a cone. 


20. A cylindrical hole of radius 1 is bored through a solid ball of radius 2 to form a ring 
coupler, as shown in Figure 7.4.9. Find the volume and outer surface area of this coupler. 


Figure 7.4.9 Find the outer surface area and volume of 
Y? the shaded region. 
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21. Find the area of the graph of the function f(x, y) = 3(x°/? + 597?) that lies over the 
domain [0, 1] x [0, 1]. 


22. Express the surface area of the following graphs over the indicated region D as a double 
integral. Do not evaluate. 


(a) (x + 2y)?; D = [71,2] x [0, 2] 

(b) xy +x/(y +1); D =[1, 4] x [1, 2] 

(c) xyler'; D = unit circle centered at the origin 

(d) y cos x; D = triangle with vertices (—1, 1), (0, 2), and (1, 1) 


23. Show that the surface area of the upper hemisphere of radius R, z = y R? — x? — y?, 
can be computed by formula (4), evaluated as an improper integral. 


7.5 Integrals of Scalar Functions Over Surfaces 


Now we are ready to define the integral of a scalar function f over a surface S. 
This concept is a natural generalization of the area of a surface, which corresponds 
to the integral over S of the scalar function f(x, y, z) — 1. This is quite analogous to 
considering the path integral as a generalization of arc length. In the next section we 
shall deal with the integral of a vector function F over a surface. These concepts will 
play a crucial role in the vector analysis treated in the final chapter. 

Let us start with a surface 5 parametrized by a mapping ®: D — S C R?, where 
D is an elementary region, which we write as (u, v) = (x(u, v), y(u, v), z(u, v)). 


DEFINITION: The Integral of a Scalar Function Over a Surface If 
f(x, y. z) isareal-valued continuous function defined on a parametrized surface 
S, we define the integral of f over S to be 


[[ res - ff ras = ff rens o, ranas. w 


Written out, equation (1) becomes 


» ax »)Y , [80.2 , [962] 
f[ras- ff setosa o [822] + [5-2] + [423] dudv. (2) 
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Thus, if f is identically 1, we recover the area formula (3) of Section 7.4. Like 
surface area, the surface integral is independent ofthe particular parametrization used. 
This will be discussed in Section 7.6. 

We can gain some intuitive knowledge about this integral by considering it as 
a limit of sums. Let D be a rectangle partitioned into n? rectangles Rij with areas 
Au Av. Let Sj; = 9(R;;) be the portion of the surface ®(D) corresponding to Rj; 
(see Figure 7.5.1), and let A(S;;) be the area of this portion of the surface. For large 
n, f will be approximately constant on S;;, and we form the sum 


n-l n-l 
$,— 3 J MSU, yp AS), 6) 


i-0 j=0 


where (u;, vj) € Rij. From Section 7.4 we have a formula for A(S;;): 
As = [[ m ratus, 
Rij 


which, by the mean-value theorem for integrals, equals |[T,; x T; || Au Av for some 
point (už, vj) in Rj;. Hence, our sum becomes 


n—-ln-l 


$ — OYE fi v); x To Au Av, 


i=0 j=0 


which is an approximating sum for the last integral in formula (1). Therefore, 


limit S, = Í ji fas. 
S ; 


zZ 
o 
D a 
— 
á P "UN 
P 4 


Figure 7.5.1 € takes a portion Rj; of D to a portion of S. 
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Note that each term in the sum in formula (3) is the value of f at some point 
®(u;, vj) times the area of S;;. Compare this with the Riemann-sum interpretation of 
the path integral in Section 7.1. 

If S is a union of parametrized surfaces S;, i = 1,..., N, that do not intersect 
except possibly along curves defining their boundaries, then the integral of f over S 


is defined by 
N 
Il 55- Xf fds, 
s i= * 78i 


as we should expect. For example, the integral over the surface of a cube may be 
expressed as the sum of the integrals over the six sides. 


OWN] Suppose a helicoid is described as in Example 2, Section 7.4, and 
let f be given by f(x, y, z) = yx? + y? + 1. Find ff, f dS. 
SOLUTION As in Examples | and 2 of Section 7.4, 


30»). 9(y, z) din a(x, z) 
8(,0) ^" ae, O) 8(r,0) 


= cos 6. 
Also, f(r cos 6, r sin0, 0) = Vr? + 1. Therefore, 
[[ 765 ff rete, om, rano 
s D 


2x 1 21 4 8 
=f [ Visi Iard = [ déc sm. A 
o Jo 0 


Surface Integrals Over Graphs 


Suppose S is the graph of a C! function z = g(x, y). Recall from Section 7.4 that we 
can parametrize S by 


xau, yv, z — g(u, v), 


and that in this case 


9gV | fag V? 
IT, x T] = ie (35) +(#), 
ðu ðv 
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so 


a 2 2 
[[ teoraas= ff re e n (28) +E) aca. @ 


DAIJI Let S be the surface defined by z = x? + y, where D is the region 
0<x <1,-1<y <1. Evaluate ff. x dS. 


SOLUTION If we letz = g(x, y) = x? + y, formula (4) gives 


ag\? fagy? 1 fi 
[[=as= ffi» 1+ (3) +(#) dx dy = f [ orm 
S D -1/0 
1 1 1 2 1 t 
= if, [ (2 + 4x?) "(8x as) ay = A 5 [Ima 
1 ! 3/2 3/2 1 3/2 3/2 V2 
-af 6 — 2B?) dy = 6? -2 )=v6- = 
eual«a- sl. a 
3 


EXAMPLE 3 Evaluate [hz dS, where S is the unit sphere x? + y? +z? = 1. 


SOLUTION For this problem, it is convenient to use spherical coordinates 
and to represent the sphere parametrically by the equations x = cos@ sing, y = 
sin 0 sing, z = cos $, over the region D in the 0$ plane given by the inequalities 
0<¢<12,0 <6 -2.From equation (1) we get 


J|? aS [f eser. x Toll do dọ. 


A little computation [use formula (2) of Section 7.4; see Exercise 6] shows that 
IT, x Toll = sind. 


(Note that for 0 < @ < x, we have sing > 0). Thus, 


2n d 
[[2as=[ f cos? ¢ sing dọ d6 
S 0 Jo 
4n 


1 2n 3 2 2x 
a Hd = = 
=3/, [—cos"$] d0 Il do 7" A 
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This example also shows that on a sphere of radius R, 


[jis [p "n (o, OR’ sing dé do, 


or, for short, the area element on the sphere is given by 
dS = R sing d$ d0. 


Integrals Over Graphs 


We now develop another formula for surface integrals when the surface can be repre- 
sented as a graph. To do so, we let S be the graph of z = g(x, y) and consider formula 
(4). We claim that 


II fy gas = [| EELEE ay ay, (5) 
S D 


cos 0 


where 6 is the angle the normal to the surface makes with the unit vector k at the point 
(x, y, g(x, y)) (see Figure 7.5.2). Describing the surface by the equation $(x, y, z) = 
z — g(x, y) = 0, a normal vector N is V; that is, 


N2-98, 98 yk (6) 
X 


AScos0—- AA 


Figure 7.5.2 The area of a patch of area AS 
over a patch AA is AS = A4/cos0, where 6 
is the angle the unit normal n makes with k. 


b. EN 
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[see Example 4 of Section 7.3, or recall that the normal to a surface with equation 
g(x, y, z) = constant is given by Vg]. Thus, 


N-k 1 
INIL (@g/3x} + (9g/8yY. +1 
Substitution of this formula into equation (4) gives equation (5). Note that 
cos@ = n- k, where n = N/||N|| is the unit normal. Thus, we can write 
_ dx dy 
~ nek 


cos 0 


ds 


The result is, in fact, obvious geometrically, for if a small rectangle in the x y plane 
has area A A, then the area of the portion above it on the surface is AS = AA/cos@ 
(Figure 7.5.2). This intuitive approach can help us to remember formula (5) and to 
apply it in problems. 


SES] Compute ff. x dS, where S is the triangle with vertices (1, 0, 0), 
(0, 1, 0), (0, 0, 1) (see Figure 7.5.3). 


Figure 7.5.3 In computing a specific surface integral, 
one finds a formula for the unit normal n and computes 
the angle @ in preparation for formula (5). 


SOLUTION This surface is the plane described by the equation x + y +z = 1. 
Because the surface is a plane, the angle @ is constant and a unit normal vector is 
n = (1/73, 1/4/3, 1/3). Thus, cosó = n- k = 1/4/3, and by equation (5), 


[ [545 5 [[ xaxa, 


where D is the domain in the xy plane. But 
1 plex 1 NZ 
Vi [[ xixa] f xdydx = V3 | x(l—x)dx = ———. A 
D o Jo 0 


Integrals of functions over surfaces are useful for computing the mass of a surface 
when the mass density function m is known. The total mass of a surface with mass 
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density (per unit area) m is given by 
as) |f mo. v.s. (7) 
s 


ANVE] Let dS: D — R? bethe parametrization of the helicoid S = (D) 
of Example 2 of Section 7.4. Recall that ®(r, 0) = (r cos0, r sin0, 0), where 0 < 
0 < 2x, and 0 <r < 1. Suppose S has a mass density at (x, y, z) € S equal to twice 
the distance of (x, y, z) from the central axis (see Figure 7.4.2), that is, m(x, y, z) — 
2 x? + y? = 2r, in the cylindrical coordinate system. Find the total mass of the 
surface. 


SOLUTION Applying formula (7), 


ac» f [2/96 zas= ff aras= ff arin, x Toll dr d0. 
S D 


From Example 2 of Section 7.4, we see that || T, x Tg|| = 4/1 + r?. Thus, 


2n 1 
uc ff Vit ržarao = [ f 2ry1+r?dr do 
D 0 0 
mna i dE" 4n 
>f [Far] a= [ = (23? — 1)d9 = — (29? 1). a 
0 3 0 o 3 3 


EXERCISES 


1. Compute [f. xy dS, where S is the surface of the tetrahedron with sides z = 0, y = 0, 
x+z=l,andx=y. 


2. Evaluate Un xyz dS, where S is the triangle with vertices (1, 0, 0), (0, 2, 0), and (0, I, 1). 


3. Evaluate ff. z dS, where S is the upper hemisphere of radius a, that is, the set of (x, y, z) 


with z = ya? — x? — y?. 


4. Evaluate Fx + y 4- z) dS, where S is the boundary of the unit ball B; that is, S is the 
set of (x, y, z) with x? + y? + z? = 1. (Hint: Use the symmetry of the problem.) 


5. (a) Compute the area of the portion of the cone x? + y? = z? with z > 0 that is inside 
the sphere x? + y? +z? = 2Rz, where R is a positive constant. 
(b) What is the area of that portion of the sphere that is inside the cone? 
6. Verify that in spherical coordinates, on a sphere of radius R, 


Ty x Tol d$ d0 = R’ sing d do. 


7. Evaluate f. z dS, where S is the surface z = x? + y?, x? - y? <1. 
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8. Evaluate the surface integral ffy z? dS, where S is the boundary of the cube 
C =[-1, 1] x [-1, 1] x [—1, 1]. (Hint: Do each face separately and add the results.) 


9. Find the mass of a spherical surface S of radius R such that at each point (x, y, z) € S 
the mass density is equal to the distance of (x, y, z) to some fixed point (xo, yo, zo) € S. 


10. A metallic surface S is in the shape of a hemisphere z = y R? — x? — y?, where (x, y) 
satisfies 0 < x? + y? < R?. The mass density at (x, y, z) € S is given by m(x, y, z) = 
x? + y. Find the total mass of S. 


11. Let S be the sphere of radius R. 


(a) Argue by symmetry that 


flees fea [o 


(b) Use this fact and some clever thinking to evaluate, with very little computation, the 


integral 
Í I x^ ds. 
s 


(c) Does this help in Exercise 10? 
12. (a) Use Riemann sums to justify the formula 


1 

AG | fe» z)dS 
for the average value of f over the surface S. 

(b) In Example 3 of this section, show that the average of f(x, y, z) = z? over the 
sphere is 1/3. 

(c) Define the center of gravity (X, y, Z) of a surface S to be such that x, y, and Z are the 
average values of the x, y, and z coordinates on S. Show that the center of gravity of the 
triangle in Example 4 of this section is (4, 4, 1). 


13. Find the x, y, and z coordinates of the center of gravity of the octant of the solid sphere 
of radius R and centered at the origin determined by x > 0, y > 0,z > 0. (HINT: Write this 
octant as a parametrized surface—see Example 3 of this section and Exercise 12.) 

14. Find the z coordinate of the center of gravity (the average z coordinate) of the surface of 
a hemisphere (z < 0) with radius r (see Exercise 12). Argue by symmetry that the average x 


and y coordinates are both zero. 


15. Let 4: D C R? — R? be a parametrization of a surface S defined by 


x — x(u, v), y= y(u, v), z = z(u, v). 
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(a) Let 


39 (ax Gy 10m and 99 [8x ay dz 
au \ du’ du’ du àv \ dv’ av’ av)’ 


that is, 9®/du = T, and d®/dv = T,, and set 


aS p 
dv 


aep 
Qu 


ð ab 
F . y G 
ðu ðv | 


i 


Show that 
VEG — F? = |T, x Til, 


and that the surface area of S is 
A(S) — I VEG — F? dudv. 
D 


In this notation, how can we express ff, fdS for a general function of f? 

(b) What does the formula for A(S) become if the vectors ®/du and 804P/8v are 
orthogonal? 

(c) Use parts (a) and (b) to compute the surface area of a sphere of radius a. 


16. Dirichlet's functional for a parametrized surface b: D — R? is defined by'? 


rove fh UST po 


Use Exercise 15 to argue that the area A(®) < J(®) and equality holds if 


Qu 


aep 


*[-| 


ðv 


ab ab 
d P VERS ey 
a. QD 


Compare these equations with Exercise 15 and the remarks at the end of Section 7.4. A 
parametrization ® that satisfies conditions (a) and (b) is said to be conformal. 


17. Let D C R? and 4: D — R? be a smooth function (u, v) = (x(u, v), y(u, v)) 
satisfying conditions (a) and (b) of Exercise 16 and assume that 


ax ax 
a a 
det i i >0. 
ay  ðy 
ðu ðv 


'2Dirichlet’s functional played a major role in the mathematics of the nineteenth century. The mathematician Georg 
Friedrich Bernhard Riemann (1826-1866) used it to develop his complex function theory and to give a proof of the 
famous Riemann mapping theorem. Today it is still used extensively as a tool in the study of partial differential equations. 
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Show that x and y satisfy the Cauchy—Riemann equations 0x /0u = dy/dv, dx /dv = 
—dy/du. Conclude that V?® = 0 (i.e., each component of ® is harmonic). 


18. Let S be a sphere of radius r and p be a point inside or outside the sphere (but not on it). 
Show that 


1 is = [n if pis inside S 
s Ix — pll O° = lnr?Jd if pis outside S, 
where d is the distance from p to the center of the sphere and the integration is over the sphere. 


19. Find the surface area of that part of the cylinder x? + z? = a? that is inside the cylinder 
x? + y? = 2ay and also in the positive octant (x > 0, y > 0, z > 0). Assume a > 0. 


20. Let a surface S be defined implicitly by F(x, y, z) = 0 for (x, y) in a domain D of R?. 
Show that 


aF 
üz 


ALES DE) 5) + GE) o 


Compare with Exercise 18 of Section 7.4. 


7.6 Surface Integrals of Vector Fields 


The goal of this section is to develop the notion of the integral of a vector field over a 
surface. Recall that the definition of the line integral of a vector field was motivated 
by the fundamental physical notion of work. Similarly, there is a basic physical notion 
of flux that motivates the definition of the surface integral of a vector field. 

For example, if the vector field is the velocity field of a fluid (perhaps the velocity 
field of a flowing river), and one puts an imagined mathematical surface into the fluid, 
one can ask: “What is the rate at which fluid is crossing the given surface (measured 
in, say, cubic meters per second)?" The answer is given by the surface integral of the 
fluid velocity vector field over the surface. 

We shall come back to the physical interpretation shortly and reconcile it with 
the formal definition that we give first. 


Definition of the Surface Integral 


We now define the integral of a vector field, denoted F over a surface S. We first 
give the definition and later in this section give its physical interpretation. This can 
also be used as a motivation for the definition if the reader so desires. Also, we shall 
start with a parametrized surface ® and later study the question of independence of 
parametrization. 
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DEFINITION: The Surface Integral of Vector Fields Let F be a vector 
field defined on 5, the image of a parametrized surface ®. The surface integral 


of F over ®, denoted by 
| f F-dS, 
$ 
is defined by (see Figure 7.6.1)) 
II F-dS= If F-(T, x Ty) dudv. 
E: D 


F(T, X T) 
z4 Tu x Till 


d(D)- S 


x 


Figure 7.6.1 The geometric significance of F - (T, x T, ). 


Let D be the rectangle in the 0 plane defined by 


EXAMPLE 1 


0<0<2m, Oxózm, 
and let the surface S be defined by the parametrization 4: D — R? given by 
x = cos@ sing, y — sinO sing, z — cos. 


(Thus, 0 and $ are the angles of spherical coordinates, and S is the unit sphere 
parametrized by ®.) Let r be the position vector r(x, y, z) = xi + yj + zk. Compute 
Sfor- aS. 


SOLUTION First we find 
To = (—sin $ sin 0)i + (sing cos 0)j 
Ty = (cos 0 cos $)i + (sin cos $)j — (sin $)k, 
and hence 
Te x Ty = (—sin?$ cos 0)i — (sin? $ sin6)j — (sing cos ¢)k. 
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Then we evaluate 


r- (To x Ty) = (xi + yj +2K) - (To x Ty) 
= [(cos 0 sin $)i + (sin 0 sin $)j + (cos $)k] 
* (—sin $)[(sin ¢ cos 0)i + (sin $ sin 0)j + (cos $)k] 
= (—sin ¢)(sin? $ cos? 0 + sin? $ sin? 0 + cos? $) = —sin $. 


2x 
I ras - ff —sin$ d$ d0 = (—2)d0 = —4n. A 
E D 0 


An analogy can be drawn between the surface integral ffp F - dS and the line integral 
Je F - ds. Recall that the line integral is an oriented integral. We needed the notion of 
orientation ofa curve to extend the definition of f. F - ds to line integrals fọ F - ds over 
oriented curves. We extend the definition of ff, F - dS to oriented surfaces in a similar 
fashion; that is, given a surface S parametrized by a mapping ®, we want to define 
Jf; F-aS = ff, F - aS and show that it is independent of the parametrization, except 
possibly for the sign. To accomplish this, we need the notion of orientation ofa surface. 


Thus, 


Orientation 


DEFINITION: Oriented Surfaces An oriented surface is a two-sided sur- 
face with one side specified as the outside or positive side; we call the other 
side the inside or negative side.'> At cach point (x, y, z) € S there are two unit 


normal vectors n; and nz, where n; = —n» (see Figure 7.6.2). Each of these two 
normals can be associated with one side of the surface. Thus, to specify a side 
of a surface S, at each point we choose a unit normal vector n that points away 
from the positive side of S at that point. 


Figure 7.6.2 The two possible unit normals to a 
surface at a point. 


'3 We use the term "side" in an intuitive sense. This concept can be developed rigorously, but this will not be done here. 
Also, the choice of the side to be named the “outside” is often dictated by the surface itself, as, for example, is the case 
with a sphere. In other cases, the naming is somewhat arbitrary (see the piece of surface depicted in Figure 7.6.2, for 
instance). 
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This definition assumes that our surface does have two sides. In fact, this is nec- 
essary, because there are examples of surfaces with only one side! The first known 
example of such a surface was the Móbius strip (named after the German mathemati- 
cian and astronomer A. F. Móbius, who, along with the mathematician J. B. Listing, 
discovered it in 1858). Pictures of such a surface are given in Figures 7.6.3 and 7.6.4. 
At each point of M there are two unit normals, n; and n;. However, n; does not 
determine a unique side of M, and neither does n». To see this intuitively, we can 
slide n; around the closed curve C (Figure 7.6.3). When n; returns to a fixed point p 
on C it will coincide with n;, showing that both n; and n? point away from the same 
side of M and, consequently, that M has only one side. 


Figure 7.6.3 The Möbius strip: Slide n; around C once; 
when n; returns to its initial point, it will coincide with 
n; = —m. 


Figure 7.6.4 is a Móbius strip as drawn by the well-known twentieth-century 
mathematician and artist M. C. Escher. It depicts ants crawling along the Móbius 
band. After one trip around the band (without crossing an edge) they end up on the 
“opposite side” of the surface. 


Figure 7.6.4 Ants walking on a Mobius strip. 


Let &: D — R? be a parametrization of an oriented surface S and suppose 
S is regular at ®(uo, vo), (uo, vo) € D; thus, the vector (Tu, x T,,)/l Tu, x Troll 
is defined. If n(®(uo, vo)) denotes the unit normal to S at (uo, vo), it follows 
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that 
(Tuy x T&)/WITu, x Tul = 3n(S(uo, vo)). 


The parametrization ® is said to be orientation-preserving if we have the + 
sign; that is, if (T, x T,)/|T, x T,|| = n(®(u, v)) at all (u, v) € D for which S is 
smooth at (u, v). In other words, ® is orientation-preserving if the vector T, x T, 
points to the outside of the surface. If T, x T, points to the inside of the surface at all 
points (u, v) € D for which S is regular at ®(u, v), then ® is said to be orientation- 
reversing. Using the preceding notation, this condition corresponds to the choice 
(T, x T,)/IT, x T, = —n(@(u, v)). 

It follows from this discussion that the Móbius band M cannot be parametrized 
by a single parametrization for which n = T, x T, 4 0 and n is continuous over 
the whole surface!^ (if there were such a parameterization, then M would indeed 
have two sides, one determined by n and one determined by —n). The sphere in 
Example 1 can be parametrized by a single parametrization, but not by one that is 
everywhere one-to-one—see the discussion at the beginning of Section 7.4. 

Thus, any one-to-one parametrized surface for which T, x Ty never vanishes can 
be considered as an oriented surface with a positive side determined by the direction 
of Tu: X Ty. 


| We can give the unit sphere x? + y? + z? = 1 in R? (gute 7.6.5) 
an orientation by selecting the unit vector n(x, y, z) = r, where r = xi + yj + zk, 
which points to the outside of the surface. This choice corresponds to our intuitive 
notion of outside for the sphere. 


z 


Figure 7.6.5 The unit sphere oriented by its 
y outward normal n. 


Now that the sphere S is an oriented surface, consider the parametrization ® of 
S given in Example 1. The cross product of the tangent vectors Ty and T; —that is, 


V There is a single parametrization obtained by cutting a strip of paper, twisting it, and gluing the ends, but it produces a 
discontinuous n on the surface. 
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a normal to S is given by 

(—sin $)[(cos 0 sin $)i + (sin 0 sin $)j + (cos $)k] = —rsing. 
Because —sin $ < 0 for 0 < $ < x, this normal vector points inward from the sphere. 
Thus, the given parametrization ® is orientation-reversing. By swapping the order of 
0 and $, we would get an orientation-preserving parametrization. A 


Orientation and the Vector Surface Element of a Sphere 


Consider the sphere of radius R, namely, x? + y? + z? = R?. It is standard practice 
to orient the sphere with the outward unit normal. In terms of the position vector r = 
xi + yj + zk, the outward unit normal is given by 

r 


n=—. 
R 


The order of spherical coordinates that goes along with this orientation, as is evident 
from Example 2, is given by the order ($, 6). The computation in Example 2 shows 
that the surface-area element is then given by 


dS = n- (T; x To)d$ d0 = rRsinó dọ d0 = nR? sing do do. 


The Orientation of Graphs 


The next example discusses the orientation conventions for graphs, We shall compute 
the area element on graphs later in this section. 


AAA] Let S be a surface described by z = g(x, y). As in equation (6), 
Section 7.5, there are two unit normal vectors to S at (xo, yo, g(Xo, yo)), namely, +n, 
where 


a m: J 
-E (x, yo)i — E (xo, yo)j +k 
Ox oy 


ag 2 ag 2 f 
[6v] + [aes] $T 
x ay 


We can orient all such surfaces by taking the positive side of S to be the side 
away from which n points (Figure 7.6.6). Thus, the positive side of such a surface is 


outside 


Figure 7.6.6 n points away from the outside 


inside 
of the surface. 
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determined by the unit normal n with positive k component—that is, it is upward- 
pointing. If we parametrize this surface by ®(w, v) = (u, v, g(u, v)), then ® will be 
orientation-preserving. A 


Independence of Parametrization 


We now state without proof a theorem showing that the integral over an oriented 
surface is independent of the parametrization. The proof of this theorem is analogous 
to that of Theorem 1 (Section 7.2); the heart of the proof is again the change of 
variables formula—this time applied to double integrals. 


THEOREM 4: Independence of Surface Integrals on Parame- 
trizations Let S be an oriented surface and let ©, and ®» be two regular 
orientation-preserving parametrizations, with F a continuous vector field defined 


on S. Then 
Il, ras - [f F-dS. 
$, ©, 


If $; is orientation-preserving and 4 orientation-reversing, then 


f[ r-- fff rs 


If f is a real-valued continuous function defined on S, and if ®; and ® are 
parametrizations of S, then 


If, fas- fff fds. 
i à 


Note that if f = 1, we obtain 


a= fff as= ff. as. 


thus showing that area is independent of parametrization. 
We can therefore unambiguously use the notation 


[[ 7-557 [[ rs 


(ora sum of such integrals, if S is a union of parametrized surfaces that intersect only 
along their boundary curves) where ® is an orientation-preserving parametrization. 
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Theorem 4 guarantees that the value of the integral does not depend on the selection 
of 9. 


Relation with Scalar Integrals 


Recall from formula (1) of Section 7.2 that a line integral ih F - ds can be thought of 
as the path integral of the tangential component of F along c (although for the case 
in which c intersects itself, the integral obtained is technically not a path integral). 
A similar situation holds for surface integrals, because we are assuming that the 
mappings € defining the surface S are one-to-one except perhaps on the boundary of 
D, which can be ignored for the purposes of integration. Thus, in defining integrals 
over surfaces, we assume in this book that the surfaces are nonintersecting. 

For an oriented smooth surface S and an orientation-preserving parametrization 
® of S, we can express ff, s F-4S as an integral of a real-valued function f over the 
surface. Let n = (T, x T,)/||T, x T,|| be the unit normal pointing to the outside of 
S. Then 


ffas- ff, r-as= fff F- (T, x T,)du dv 
= ff e GE it rae 
= ff em. x TM dudo = ff mas - ff ras. 


where f = F - n. We have thus proved the following theorem. 


THEOREM 5 /f. F-dS, the surface integral of F over S, is equal to the integral 
of the normal component of F over the surface. In short, 


J| -as= [[r-nes. 


The observation in Theorem 5 can often save computational effort, as Example 4 
demonstrates. 


The Physical Interpretation of Surface Integrals 


The geometric and physical significance of the surface integral can be understood by 
expressing it as a limit of Riemann sums. For simplicity, we assume D is a rectangle. 
Fix a parametrization ® of S that preserves orientation and partition the region D 
into n? pieces Dy,0 <i <n—-1,0< j <n—1. Welet Au denote the length ofthe 
horizontal side of D;; and Av denote the length of the vertical side of D;;. Let (u, v) 
be a point in D;;, and (x, y, z) = (u, v) the corresponding point on the surface. We 
consider the parallelogram with sides Au T, and Av T, lying in the plane tangent to 
S at (x, y, z) and the parallelepiped formed by F, Au T,, and Av T,. The volume of 
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the parallelepiped is the absolute value of the triple product 
F: (Au T, x AvT,) =F- (T, x T,) Au Av. 
The vector T, x T, is normal to the surface at (x, y, z) and points away from the out- 


side of the surface. Thus, the number F - (T, x T,) is positive when the parallelepiped 
lies on the outside of the surface (Figure 7.6.7). 


x 


Figure 7.6.7 F- (T, x T,) > 0 when the parallelpiped formed by Av T,, Au Tu, 
and F lies to the “outside” of the surface S. 


In general, the parallelepiped lies on that side of the surface away from which F 
is pointing. If we think of F as the velocity field of a fluid, F(x, y, z) is pointing in 
the direction in which fluid is moving across the surface near (x, y, z). Moreover, the 
number 


IF - (T, Au x T, Av)| 


measures the amount of fluid that passes through the tangent parallelogram per unit 
time. Because the sign of F - (Au T, x Av T,) is positive if the vector F is pointing 
outward at (x, y, z) and negative if F is pointing inward, YF (T, x T) Au Av 
is an approximate measure of the net quantity of fluid to flow outward across the 
surface per unit time. (Remember that “outward” or “inward” depends on our choice 
of parametrization. Figure 7.6.8 illustrates F directed outward and inward, given T,, 
and T,.) Hence, the integral ff. s F-4S is the net quantity of fluid to flow across the 
surface per unit time, that is, the rate of fluid flow. This integral is also called the flux 
of F across the surface. 

In the case where F represents an electric or a magnetic field, [fọ F -dS is also 
commonly known as the flux. The reader may be familiar with physical laws (such 
as Faraday S law) that relate flux of a vector field to a circulation (or current) in a 
bounding loop. This is the historical and physical basis of Stokes's theorem, which 
we will discuss in Section 8.2. The corresponding principle in fluid mechanics is 
called Kelvin 5 circulation theorem. 
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Figure 7.6.8 When F- (T, x Tj) > 0 
(left), F points outward; when 

F- (T, x T,) < 0 (right), F points 
inward. 


Surface integrals also apply to the study of heat flow. Let T (x, y, z) be the temper- 
ature at a point (x, y, z) € W C IO, where W is some region and T is a C! function. 
Then 


T T 
sita Ei 
ax oy 


oT 
—k 
"T üz 
represents the temperature gradient, and heat “flows” with the vector field —k V T =F, 


where k is a positive constant (see Section 8.5). Therefore, ff, ‘s F -dS is the total rate 
of heat flow or flux across the surface S. 


955") Suppose a temperature function is given in R? by the formula 
T(x, y, z) = x? + y? -- z^, and let S be the unit sphere x? + y? +z? = 1 oriented 
with the outward normal (see Example 2). Find the heat flux across the surface S if 
k=l: 


SOLUTION We have 
F--VT(x, y, z) = —2xi — 2yj — 2zk. 


On 5, the vector n(x, y, z) = xi + yj + zk is the unit “outward” normal to S at 
(x,y,z), and f(x, y, z) = F - n = —2x? — 2y? — 27? = —2 is the normal component 
of F. From Theorem 5 we can see that the surface integral of F is equal to the integral 
of its normal component f = F - n over S. Thus, 


[5-57 ff ras= 2 [fas =—24(8) = -26 = -&. 


The flux of heat is directed toward the center of the sphere (why toward?). Clearly, 
our observation that ff F- dS = ff, f dS has saved us considerable computational 
time. 

In this example, F(x, y, z) = —2xi — 2yj — 2zk could also represent an electric 
field, in which case Is F- dS = —8z would be the electric flux across $. A 
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Gauss’ Law There is an important physical law, due to the great 
mathematician and physicist K. F. Gauss, that relates the flux of an electric field E 
over a “closed” surface S (for example, a sphere or an ellipsoid) to the net charge Q 
enclosed by the surface, namely (in suitable units), 


[[r-55-0 (1) 


(see Figure 7.6.9). Gauss' law will be discussed in detail in Chapter 8. This law is 
analogous to Ampére's law (see Example 12, Section 7.2). 


S = closed surface 


Figure 7.6.9 Gauss’ law: ff, E-dS = Q, 


where Q is the net charge inside S. 
E = electric field 


Suppose that E — En; that is, E is a constant scalar multiple of the unit normal 
to S. Then Gauss' law, equation (1) in Example 5, becomes 


[[e-as= [[eas=e [[ 45-0 


because E = E - n. Thus, 


E= 2, (2) 
A(S) 
In the case where S is the sphere of radius R, equation (2) becomes 
Q 
E=—, 3 
4nR? 9 


(see Figure 7.6.10). 

Now suppose that E arises from an isolated point charge, Q. From symmetry it 
is reasonable that E — En, where n is the unit normal to any sphere centered at Q. 
Hence, equation (3) holds. Consider a second point charge, Qo, located at a distance 
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Figure 7.6.10 The field E due to a point charge Q is 


Q d E = Qn/AzR?. 
x^ Fi 


R from Q. The force F that acts on this second charge, Qo, is given by 


200 


F = EQ) = EQ = anm 


If F is the magnitude of F, we have 


_ 20o 
abs 4nR2’ 


which is Coulomb's law for the force between two point charges. A 


Surface Integrals Over Graphs 


Finally, let us derive the surface-integral formulas for vector fields F over surfaces S 
that are graphs of functions. Consider the surface S described by z — g(x, y), where 
(x, y) € D, where S is oriented with the upward pointing unit normal: 


og V fagy? 
= L 1 
E) t E: 
We have seen that we can parametrize S by 4: D — IR? given by (x, y) = 


(x, y, g(x, y)). In this case, Is F - dS can be written in a particularly simple form. 
We have 


8g 8g 
=i k, T,=) k. 
ita y p 


'SSometimes one sees the formula F = (1/47:59)Q Qo/ R?. The extra constant £o appears when MKS units are used for 
measuring charge. We are using CGS, or Gaussian, units. 
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Thus, T, x T, = —(8g/8x)i — (8g/8y)j +k. If F = Fi + Fj + Fk is a continu- 
ous vector field, then we get 


The Surface Integral of a Vector Field Over a Graph S 


[| r5 ff r-a x Ty) dx dy 


ff [^ C) 


EXAMPLE 6 The equations 


gi 12, x^-y < 25 
describe a disk of radius 5 lying in the plane z = 12. Suppose r is the vector field 
r(x, y, z) = xi + yj + zk. 
Compute ffor- dS. 
SOLUTION We shall do this in three ways. First, we have dz/dx = 0z/8y = 0, 
because z — 12 is constant on the disk, so 


r(x, y, z): (T; x Ty) = r(x, y, z)» (i x j) = r(x, y, z)- Ko z. 


Using the original definition at the beginning of this section, the integral becomes 


Il rs ff inm - ff 12 dx dy = 12(area of D) = 3007. 
s D D 


A second solution: Because the disk is parallel to the xy plane, the outward unit 
normal is k. Hence, n(x, y, z) = k and r-n = z. However, ||T, x T,|| = ||kl| = 1, 
and so we know from the discussion preceding Theorem 5 that 


f [r5 [ [rns ffzas= ff 12dxdy = 5007. 
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Third, we may solve this problem by using formula (4) directly, with g(x, y) — 
and D the disk x? + y? < 25: 


[[ i57 [[e omes = 12(area of D) = 3007. 4 
Š D 


Summary: Formulas for Surface Integrals 


1. Parametrized Surface: (1, v) 


(a) Integral of a scalar function f: 


f[ res 2 If, fi, v), x Toll du dv 


(b) Scalar surface element: 
S = ||T, x T,| du dv 


(c) Integral of a vector field F: 


[[ r5 ff rm on 
s D 


(d) Vector surface element: 
dS = (T, x T,)dudv=ndS 


2. Graph:z = g(x, y) 


(a) Integral of a scalar function f: 


LOY SCY) y 
If oe || eee an cos 6 


(b) Scalar surface element: 


2 2 
asa Z a (EV (EN T 
cos@ ox oy 


where cos 0 = n - k, and n is a unit normal vector to the surface. 
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(c) Integral of a vector field F: 


ag ag 
F. F, F, | 
I as fL iE - n A) dry 


(d) Vector surface element: 


ag. a 
dS=n-dS = ( -Ei Ej +k) ax ay 
ax oy 


3. Sphere: x? + y? +z? = R? 


(a) Scalar surface element: 
dS = R? sing dọ d0 
(b) Vector surface element: 


dS = (xi + yj +zk)Rsing dọ d0 = rRsing dọ d0 = nR? sing dọ dO 


EXERCISES 


1. Let the temperature of a point in IR? be given by T(x, y, z) = 3x? + 3z?. Compute the 
heat flux across the surface x? +z? = 2,0 < y < 2, ifk = 1. 


2. Compute the heat flux across the unit sphere S if T(x, y, z) — x. Can you interpret your 
answer physically? 


3. Let S be the closed surface that consists of the hemisphere x? + y? + z? = 1,z > 0, 
and its base x? 4- y? « 1, z — 0. Let E be the electric field defined by E(x, y.z)-— 
2xi 4- 2yj + 2zk. Find the electric flux across S. (HINT: Break S into two pieces Sı and 
S, and evaluate f/;, E -dS and ffy, E -+ dS separately.) 


4. Let the velocity field of a fluid be described by F = „/yi (measured in meters per 
second). Compute how many cubic meters of fluid per second are crossing the surface 
e4+z2=10<y<1,0<x<1. 


5. Evaluate ff. (V x F)- dS, where S is the surface x? + y? + 3z? = 1, z < 0, and F is the 
vector field F = yi — xj + zx?y?k. (Let n, the unit normal, be upward pointing.) 


6. Evaluate /[,(V x F)- dS where F = (x? + y — 4)i + 3xyj + (2xz + z?)k and S is the 
surface x? + y? +z? = 16, z > 0. (Let n, the unit normal, be upward pointing.) 
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7. Calculate the integral Jis F- dS, where S is the entire surface of the solid half ball 
x? +y? +2 < 1,z > 0, and F = (x + 3y5)i + (y + 10xz)j + (z — xy)k. (Let S be oriented 
by the outward pointing normal.) 


8.* A restaurant is being built on the side of a mountain. The architect's plans are shown in 
Figure 7.6.11. 


— Restaurant x24 y? — AR 2 


Side view 


Top view 


Figure 7.6.11 Restaurant plans. 


(a) The vertical curved wall of the restaurant is to be built of glass. What will be the 
surface area of this wall? 

(b) To be large enough to be profitable, the consulting engineer informs the developer 
that the volume of the interior must exceed R*/2. For what R does the proposed structure 
satisfy this requirement? 

(c) During a typical summer day, the environs of the restaurant are subject to a 
temperature field given by 


T(x, y, z) = 3x? + (y — R + 162°. 
A heat flux density V = —k VT (k is a constant depending on the grade of insulation to be 
used) through all sides of the restaurant (including the top and the contact with the hill) 
produces a heat flux. What is this total heat flux? (Your answer will depend on R and k.) 


9. Find the flux of the vector field V(x, y, z) = 3xy?i + 3x?yj + 2°k out of the unit sphere. 


10. Evaluate the surface integral ff, F - nd A, where F(x, y, z) = i + j+2(x? + ?y'k and S 
is the surface of the cylinder x? + y? < 1,0 <z <1. 


*The solution to this problem may be somewhat time-consuming. 
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11. Let S be the surface of the unit sphere. Let F be a vector field and F, its radial 


component. Prove that 
2n m 
ffr dS =f f F, sinġ d$ dé. 
S 6-0 Jġ=0 


What is the corresponding formula for real-valued functions f? 


12. Prove the following mean-value theorem for surface integrals: If F is a continuous vector 
field, then 


i] T F- ndS = [F(Q)- n(Q]AG) 


for some point Q € S, where A(S) is the area of S. [HINT: Prove it for real functions first, by 
reducing the problem to one of a double integral: Show that if g > 0, then 


Jf n 6 ff eas 


for some Q € D (do it by considering (ffp fg dA)/( ffp g dA) and using the intermediate 
value theorem).] 


13. Work out a formula like that in Exercise 11 for integration over the surface of a cylinder. 


14. Let S bea surface in R? that is actually a subset D of the xy plane. Show that the integral 
of a scalar function f(x, y, z) over S reduces to the double integral of f(x, y, z) over D. 
What does the surface integral of a vector field over S become? (Make sure your answer is 
compatible with Example 6.) 


15. Let the velocity field of a fluid be described by F = i + xj + zk (measured in meters per 
second). Compute how many cubic meters of fluid per second are crossing the surface 
described by x? Ty +2=1,z2>0. 


16. (a) A uniform fluid that flows vertically downward (heavy rain) is described by the 
vector field F(x, y, z) = (0, 0, —1). Find the total flux through the cone z = (x? + y?)!/?, 
x+y? <1. 

(b) The rain is driven sideways by a strong wind so that it falls at a 45° angle, and it is 
described by F(x, y, z) = —(V2/2, 0, /2/2). Now what is the flux through the cone? 


17. Fora > 0, b > 0,c > 0, let S be the upper half ellipsoid 


2 2 


Y a 
itg lzz0y, 


x 
a 


S [esa 


with orientation determined by the upward normal. Compute ff, F - dS where F(x, y, z) = 
(x3, 0, 0). 
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18. If S is the upper hemisphere ((x, y, z) | x? + y? +z? = 1, z > 0} oriented by the normal 
pointing out of the sphere, compute Is F - dS for parts (a) and (b). 


(a) Fœ, y,z)=xi+ yj 

(b) FQ, y, z) = yi + xj 

(c) For each of these vector fields, compute ff. (V x F)- dS and fo F- ds where C is 
the unit circle in the xy plane traversed in the counterclockwise direction (as viewed from the 
positive z axis). (Notice that C is the boundary of S. The phenomenon illustrated here will be 
studied more thoroughly in the next chapter, using Stokes’ theorem.) 


7.7 Applications to Differential Geometry, Physics, and 
Forms of Life* 


In the first half of the nineteenth century, the great German mathematician Karl 
Friedrich Gauss developed a theory of curved surfaces in R?. More than a century 
earlier, Isaac Newton had defined a measure of the curvature of a space curve, and 
Gauss was able to find extensions of this idea of curvature that would apply to surfaces. 
In so doing, Gauss made several remarkable discoveries. 


Curvature of Surfaces 


For paths c: [a, b] — IR? that have unit speed—that is, ||e/(t)|| = 1—the curvature x 
ofthe image curve «x (e(t)) at the point c() is defined to be the length ofthe acceleration 
vector. That is, ||e”(¢)|| = x (e(t)). For paths c in space, the curvature is a true measure 
of the curvature of the geometric image curve C. As we saw at the end of Section 
7.1, the “total curvature" f « ds over C has “topological” implications. The same, 
and even more, will hold for Gauss' definition of the total curvature of a surface. We 
begin with some definitions. 
Let &: D — R? be a smooth parametrized surface. Then, as we know, 


o 
T, -2 


_ 80 
“au 


d T,-— 
an bcm 


are tangent vectors to the image surface S = ®(D) at the point (u, v). We will also 
assume that there is a well-defined normal vector; that is, we assume the surface is 
regular: T, x T, 40. 

Let 


0d 2 


ðu 


0o 
dv 


E= 


, 


2 

ab 09 

pa c=| 
ðu ðv 


In Exercise 15 of Section 7.5, we saw that 


IIT, x T.I? = EG — F’. 


*This section can be skipped on a first reading without loss of continuity. 
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For notational reasons, we denote EG — F? by W. Furthermore, we let 


O xT, xT, 
IT, x Toll JW 


denote the unit normal vector to the image surface at p = (u, v). Next we will define 
two new measures of the curvature of a surface at p—the “Gauss curvature,” K (p), 
and the “mean curvature,” H(p). Both of these curvatures have deep connections to 
the curvature of space curves, which illuminate the meaning of their definitions, but 
we do not explore these here. 

To define these two curvatures, we first define three new functions £, m,n on S 
as follows: 


2 


LP) = Nw, v) 5 = NG, v) Bin 


du 
2 
m(p) = N(u, v)- 2. — Nui, v): Bye a) 
dudv 
2 
n(p) = N(u, v)- = = N(u, v): By. 
Qv 


The Gauss curvature K( p) of S at p is given by 


én — m? 
K = " 2 
(p) y Q) 
and the mean curvature H(p) of S at p is defined by!6 
G£ -- En — 2Fm 
Hp = E 
(p) 2W ; (3) 


where the right-hand sides of both expressions are calculated at the point p = ®(u, v). 


Planes Have Zero Curvature Let (uw, v) = ou + v + v, 
(u, v) € R2, where a, Ø, ^y are vectors in IR?. According to Example 1 of Section 
7.3, this determines a parametrized plane in IR*. Show that at every point, both the 
Gauss and mean curvatures are zero, and hence K and H vanish identically. 


SOLUTION Because ®,,, = ®,, = ®,, — 0, the functions £,m,n vanish 
everywhere, and so do H and K. Thus, a plane has “zero” curvature. Hence, at 
least in this example, we ought to be convinced that H and K actually do measure 
the flatness of the plane. Conversely, one can show that if H and K vanish identically, 
then S is part of a plane (see Exercise 10). A 


'éTechnically speaking, K( p) and H( p) could, in principle, depend on the parametrization ® of S, but one can show that 
they are, in fact, independent of $. 
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>. i woe" Curvature of a Hemisphere Let 
P(u, v) = (u, v, g(u, v), 
where g(u, v) = ~ R? — u? — v? is a parametrization of the “upper hemisphere" of 


radius R. Show that the Gauss curvature at every point is 1/ R? and the mean curvature 
is 1/R. 


SOLUTION We must first calculate the following quantities: 
Tu, T», T, x Ty, Bun, Buv, Buv, E, G, F, £, m,n. 


First of all, we have 


6,28, Sin 
JR ee 
v 
$,—T 


From formula (2) in Section 7.3, we have 


ag. 0g 


T,xT, = i - Si+k 
u PR v š 
= ETE mew 
Therefore, 
R? — v? 
= 2 as = 
E = |i =1+——3 i = Powe 
R? — u 
g 
G=|® ll Rouy? 
P=6,.3,-—” 


m 42 —y2 * 
From Exercise 15 of Section 7.5, we know that 
(R? — v?)(R? — u?) — u?v? 
(R? —y2— vy 
R — R?u? — R2y2 R? 
= = W. 
(R? — u? — v2} (R2 — 2 — v2) 


IT, x T,|? = EG— F? = 
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Now a direct calculation shows that 


R? — v? 
Duy 
(R? ee v2)3/2 
pa E 
by, = —— —— 
(R? -a-yyn 
o, = = 


(R2 — u? — vp? k. 


Furthermore, 


T, x T, _ Ty x T, 
IT, x Tl MW 
JR? — u? — v2 u , v , 
. F j+k 


R 


R 
Thus, 
1 R? — v? 
nni (maa) 
1 R? — u? 
MEN Bie dte) 
l uv 
iN B d rm 
Therefore, 


1 ((R? — v?)(R? — u?) — iv? 
R2 

1 
R? — u? — v?’ 


Dividing this by W yields K = 1/ R?. Thus, the Gauss curvature does not change 
from point to point on the hemisphere; that is, it is constant. This conforms to our 
intuition that the sphere is perfectly symmetrical and that its curvature is everywhere 
equal.. Hence, the mean curvature should also be constant. This is verified by the 
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following calculation: 


H= G£ -- En—2Fm 
2W 


1 R? — u? 1 R? — x? 
2W | (R2 — u? — v?) R \ R? — u? -— v 

' R? —v? 1 R? — u? 2 uv? 
(R2 =u? — v?) R\R2— Ww —v? (R? — u? — v2)? 


1 R 1 " 
W | R2-—u?-— v? R` 


Surfaces of Constant Curvature 


Surfaces ofconstant Gauss and mean curvature are of great interest to mathematicians. 
It was known in the nineteenth century that the only closed and bounded smooth 
surfaces with “no boundary” and with constant Gauss curvature were spheres. In the 
twentieth century, the Russian mathematician Alexandrov showed that the only closed 
and bounded smooth surfaces without a boundary that do not intersect themselves 
and that have constant mean curvature must also be spheres. Mathematicians believed 
that Alexandrov's result held even if the surface was allowed to intersect itself, but no 
one could find a proof. In 1984, Professor Henry Wente (Toledo, Ohio) startled the 
world by finding a self-intersecting torus of constant mean curvature. 

Surfaces of constant mean curvature are physically relevant and occur throughout 
nature. Soap bubble formations have constant nonzero mean curvature (see Figure 
7.7.1), and soap film formations (containing no air) have constant mean curvature 
zero (see Figures 7.7.2 and 7.7.3). 


Figure 7.7.1 Soap bubble formation; H = constant. 
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Figure 7.7.2 A helicoid, Figure 7.7.3 A soap film, H — 0, spanning two 
H=0. circular wires; this one is the catenoid. 


In the early nineteenth century, the French mathematician Delaunay discovered 
all surfaces of revolution that have constant mean curvature. They are the cylinder, 
sphere, catenoid, unduloid, and nodoid. The catenoid exists as a soap film surface 
spanning two circular contours. 


Optimal Shapes in Nature 


Throughout the ages, people have speculated on why things are shaped the way they 
are. Why are the earth and the stars “round” and not cubical? Why are life forms 
shaped the way they are? 

In 1917, the British natural philosopher D’ Arcy Thompson published a provoca- 
tive work entitled On Growth and Form, in which he investigated the forces behind 
the creation of living forms in nature. He wrote: 


In an organism, great or small, it is not merely the nature of the motions of 
the living substance which we must interpret in terms of force (according 
to kinetics), but also the conformation of the organism itself, whose 
permanence or equilibrium is explained by the interaction or balance of 
forces, as described in statics. 


Surprisingly, Thompson discovered a// of Delaunay's surfaces in the form of unicel- 
lular organisms (see Figure 7.7.4). The constant mean curvature of these organisms 
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can be explained by minimum principles similar to those described in the Historical 
Note in Section 3.3. In 1952, Watson and Crick determined that the structure of DNA 
is that of a double helix, a discovery that set the stage for the genetic revolution. We 
know from soap films, as in Figure 7.2.2, that nature likes helicoid forms, and nature 
tends to repeat patterns. A better understanding of the scientific principles underly- 
ing life may ultimately help mathematics play a more prominent role in this area of 
theoretical biology. 


Figure 7.7.4 Surfaces of revolution of 
constant mean curvature as unicellulars. 


Curvature and Physics 


The theory of curved surfaces, initiated by Gauss, has had a profound effect on 
physics. Gauss realized that the Gauss curvature K of a surface depended only on the 
measure of distance on the surface itself; that is, curvature was intrinsic to the surface. 
This is not true of the mean curvature H. Thus, beings “living” on the surface would 
be able to tell that the surface was curving, without any reference to an "external" 
world. Gauss himself found this mathematical result to be so striking that he named it 
theorema egregium, or “remarkable theorem.” Gauss’ theory was generalized by his 
student Bernhard Riemann to n-dimensional surfaces for which one could describe a 
notion of curvature. 

Recall that Newton created the idea of a gravitational force acting over vast 
galactic distances, pulling galaxies together as well as pushing them apart (see 
Figure 7.7.5). In the early 1900s, Albert Einstein used Riemann's ideas to develop 
the general theory of relativity, a theory of gravitation that eliminated the need to 
consider forces (as Newton did) acting over great distances. Einstein's theory 
explained the bending of light by the sun, black holes, the expansion of the uni- 
verse, the formation of galaxies, and the Big Bang itself. For most applications, 
including the dynamics of our solar system, Newton's theory suffices and is com- 
monly used today by NASA to plan space missions, as we saw in Section 4.1. But for 
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Figure 7.7.5 The Andromeda Galaxy. It will collide with the Milky Way in roughly 
2 billion years. 


cosmological applications on the grand scale, Einstein's theory replaced that of Isaac 
Newton, published in his Principia in 1687. 

As a testament to his genius, and despite the astounding success of this theory, 
Newton was nevertheless disturbed by questions about how this gravitational force 
acted. He could give no other explanation than to say, “I have not been able to de- 
duce from phenomena the reason for these properties of gravitation, and I do not 
invent hypotheses; for anything which cannot be deduced from phenomena should 
be called an hypothesis." Moreover, in a letter to his friend, Richard Bentley, Newton 
wrote: 


That gravity should be innate, inherent and essential to matter, so that one 
body may act upon another at a distance, through which their action may be 
conveyed from one to another, is to me so great an absurdity that I believe 
no man, who has in philosophical matters a competent faculty of thinking, 
can ever fall into it. 


Newton coined the term action at a distance (which means "force acting at a 
distance") to describe the mysterious effect of gravitation over large distances. This 
effect is as difficult to understand today as it was in Newton's time. 

Johann Bernoulli found it difficult to believe in the concept of a force that acts 
through a vacuum of space over distances of even hundreds of millions of miles. 
He viewed this force as a concept revolting to minds unaccustomed to accepting 
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Figure 7.7.6 Albert Einstein (1879— 
1955) at his desk in the Patent Office, 
Bern, 1905. 


any principle in physics, save those that are incontestable and evident. Additionally, 
Leibniz considered gravitation to be an incorporeal and inexplicable power, philo- 
sophically void. 

It was perhaps Albert Einstein's greatest inspiration (see Figure 7.7.6) to re- 
place Newton's model of gravitation with a model that would have thrilled the early 
Greeks—a geometric model of gravitation. In Einstein's theory, the concept of a force 
acting through great distances has been replaced by the curvature of a space-time!” 
world. As the quote at the beginning of the chapter illustrates, W. K. Clifford had 
a premonition of events to come! In order to elucidate Einstein's scheme, we shall 
present an oversimplified model that conveys some of his basic ideas. 

We represent space by a surface that we imagine as an originally flat trampoline 
(the vacuum state), which is at some point strongly deformed by the weight of a 
gigantic steel ball (the sun). A tiny steel ball rolling on the trampoline is our planet 
Earth (see Figure 7.7.7) 

If we roll the small steel ball across the flat trampoline, it will travel in a straight- 
line path. However, if we now place the gigantic steel ball in the center, it will cause 
the trampoline to bend, or “curve,” even “far away" from the large ball. If we then 
give our little ball a push, it will no longer travel in a straight line but in a curved 


V Space-time is locally like R* with three space coordinates and one time coordinate. 


7.7 Applications to Differential Geometry, Physics, and Forms of Life 509 


Figure 7.7.7 (a) A particle on a taut trampoline moves in a straight line. (b) A 
heavy steel ball distorts the trampoline. (c) A particle on the distorted trampoline 
follows a curved path. 


path. The big ball affects the trajectory of the little ball by curving the space around 
it. With just the right push, the little ball might even orbit the big one for a while. 
This trampoline model explains how a large body could, by curving space, influence 
a small one over great distance. 

Einstein stated that space-time is curved by matter and energy. In this curved 
space-time, even light rays are bent as they pass near massive objects like our sun. 
Thanks to Gauss and Riemann, the curvature of space-time requires no external 
"universe" in which it curves. 

The equations that tell one how much space and time are curved by matter and 
energy are known as Einstein 5 field equations. A description of them is beyond the 
scope of this book, but the mathematical kernel from which these equations arise is 
not; this kernel is based on another remarkable result of the research of Gauss and 
Bonnet. 


Gauss-Bonnet Theorem 


In Example 2, we computed the Gauss curvature K of the sphere x? + y? +z? = R? 
of radius R and found it to be the constant 1/R?. The Gauss curvature K is a scalar- 
valued function over the surface, and as such we can integrate it over the surface. We 
wish to consider a constant times this surface integral, namely, 


1 
I K dA. 
27 I ad 


For the sphere of radius R, this quantity becomes 


1 4n R? 
= = 2: 
2x R? JI. aa 2x R? 
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What Gauss and Bonnet discovered was that if S is any “sphere-like” closed surface 
(closed and bounded, but with no boundary, as in Figure 7.7.8), then 


1 
= K dA=2 
x: JI 


still holds. 


Figure 7.7.8 A deformed sphere. 1/27 S s KdA-2. 


Thus, the integral 


1 
z ff «^ 


always equals the integer 2, and is therefore a topological invariant of the surface. 
That the integral of curvature should be an interesting quantity should be already 
clear from the discussion at the end of Section 7.1. 

Now consider a torus, or doughnut. The torus can be considered as coming from 
the sphere by cutting out two disks and gluing in a handle (see Figure 7.7.9). 

Moreover, we can continue this process adding 1, 2, 3, ..., g handles to the 
sphere. If g handles are attached, we call the resulting surface a surface of genus g, 
as in Figure 7.7.10. Notice that the torus has genus 1. 

If two surfaces have a different genus, they are topologically distinct, and thus 
cannot be obtained from one another by bending or stretching. Interestingly, even two 
surfaces with the same genus can sit in space in quite different and complex ways, 
as in Figure 7.7.11. Astonishingly, even though the integral (or total curvature) given 
by (1/27) fff. s K dA depends on the genus, it does not depend on how the surface sits 
in space (and thus not on K). 


"S Roughly speaking, this means that S can be obtained from the sphere by bending and stretching (like with a balloon) 
but not tearing (the balloon bursts!) 
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Handle 


Torus 


Figure 7.7.9 Gluing a handle to a sphere to obtain a torus. 


Figure 7.7.10 A sphere 
with 0, 1, 2, 3 handles 
attached. 


Gauss and Bonnet proved that 


1 
I K dA 22 —2g. 
x ll g 


Thus, for the sphere (g = 0), it is always 2 (already verified); for the torus, it is always 
0 (see Exercise 8). 

There is something even more remarkable connected to the theorem of Gauss— 
Bonnet, observed by the great German mathematician David Hilbert (Figure 
7.7.2). 
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Simple double doughnut 


Baker's pretzel 


Figure 7.7.11 Two manifestations of a surface S in IR? of genus 2. 


Figure 7.7.12 David Hilbert (1862-1943) was a leading 
mathematician of his time. 


Hilbert observed that the Gauss-Bonnet theorem is, in effect, a two-dimensional 
version of Einstein's field equations. In the physics literature, this fact is known as 
Hilbert s action principle in general relativity.'? Not surprisingly, similar geometric 
ideas are being employed by contemporary researchers in an effort to unify gravity 
and quantum mechanics—to “quantize” gravity, so to speak. 


'See C. Misner, K. Thorne, and A. Wheeler, Gravitation, Freeman, New York, 1972. 
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EXERCISES 
1. The helicoid can be described by 


@(u, v) = (u cos v, u sin v, bv), where b # 0. 


Show that H = 0 and that K = —b?/(b? + u?y". In Figures 7.7.1 and 7.7.5, we see that the 
helicoid is actually a soap film surface. Surfaces in which H = 0 are called minimal surfaces. 


2. Consider the saddle surface z — xy. Show that 


-1 
K = —— 
( x yn 


and that 


—xy 
H= ua 
(+2? + yj? 


3. Show that ®(u, v) = (u, v, log cos v — log cos u) has mean curvature zero (and is thus a 
minimal surface; see Exercise 1). 


4. Find the Gauss curvature of the elliptic paraboloid 


7. Show that Enneper’s surface 
3 3 
$(u, v) = (u- F +u, v— Tiun) 


is a minimal surface (H = 0). 


8. Consider the torus 7 given in Exercise 4, Section 7.4. Compute its Gauss curvature and 
verify the theorem of Gauss-Bonnet. [Hint: Show that || Tọ x Tl]? = (R + cos 9)? and 
K = cos Q/(R + cos ¢).] 


9. Lét (u, v) = (u, h(u) cos v, h(u)sin v), h > 0, be a surface of revolution. Show that 
K = —h"[h(Y + (hy yl. 
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10. A parametrization ® of a surface S is said to be conformal (see Section 7.4), provided 
that E = G, F = 0. Assume that & conformally parametrizes S.?? Show that if H and K 
vanish identically, then S must be part of a plane in R?. 


REVIEW EXERCISES FOR CHAPTER 7 
1. Integrate f(x, y, z) = xyz along the following paths: 


(a) e(t) = (e' cost, e' sint, 3),0 < t < 2x 
(b) e(t) = (cost, sint, t),0 < t < 2x 

(c) e(t) = 37i -2Pj (Kk, 0 £61 

(d) c(t) = ti + (1//2)?j + 39k, 0 € (£1 


2. Compute the integral of f along the path c in each of the following cases: 


(a) f(x,y,z) 2 x +y + yzie(t) = (sint, cost, t), 0 < t € 2x 
(b) f(x, y, z) = x + cos? z; e(t) = (sint, cost, t), 0 < t € 2x 
©) fG yz) =x +y +z elt) = (t, P, $0), 0<t<1 


3. Compute each of the following line integrals: 


(a) (sin zx)dy — (cos zy) dz, where C is the triangle whose vertices are (1, 0, 0), 
(0, 1, 0), and (0, 0, 1), in that order 

(b) fj. (sinz) dx + (cosz) dy — (xy)!? dz, where C is the path c(0) = (cos? 6, sin’ 0, 0), 
0<6<7x/2 


4. If F(x) is orthogonal to c'(t) at each point on the curve x = c(t), what can you say about 
J.F-ds? 


5. Find the work done by the force F(x, y) = (x? — y?)i + 2xyj in moving a particle 
counterclockwise around the square with corners (0, 0), (a, 0), (a, a), (0, a), a > 0. 


6. A ring in the shape of the curve x? + y? = a? is formed of thin wire weighing |x| + |y| 
grams per unit length at (x, y). Find the mass of the ring. 
7. Find a parametrization for each of the following surfaces; 


(a) x? + y! +2? —4x — 6y = 12 
(b) 2x? + y? +27 -—8x = 1 
(c) 4x? + 9y? — 22? = 8 


8. Find the area of the surface defined by ®: (u, v) +> (x, y, z), where 


x —h(u, v) Zu- v, y — g(u,v) 2 u, z= f(u, v) =v; 


0<u<1,0<v <1. Sketch. 


20 Gauss proved that conformal parametrization of a surface always exists. The result of this exercise remains valid even 
if ® is not conformal, but the proof is more difficult. 
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9. Write a formula for the surface area of ®: (r, 0) +> (x, y, z), where 
x =rcosé, y = 2rsiné, z-—r 
0 <r <1,0 <6 < 2m. Describe the surface. 


10. Suppose z = f(x, y) and (0f/dx)? + (Af/dy) = c, c > 0. Show that the area of the 
graph of f lying over a region D in the xy plane is y1 + c times the area of D. 


11. Compute the integral of f(x, y, z) = x? + y? + z? over the surface in Review 
Exercise 8. 


12. Find ffs f dS in each of the following cases: 


(a) f(x, y, z) = x; S is the part of the plane x + y + z = 1 in the positive octant 
defined by x > 0,y>0,z>0 

(b) f(x, y, z) = x^; S is the part of the plane x = z inside the cylinder x? + y? = 1 

(c) f(x, y, z) = x; S is the part of the cylinder x? + y? = 2x with 0 < z < yx? + y? 


13. Compute the integral of f(x, y, z) = xyz over the rectangle with vertices (1, 0, 1), 
(2, 0, 0), (1, 1, 1), and (2, 1, 0). 


14. Compute the integral of x + y over the surface of the unit sphere. 


15. Compute the surface integral of x over the triangle with vertices (1, 1, 1), (2, 1, 1), and 
(2, 0, 3). 


16. A paraboloid of revolution S is parametrized by ®(u, v) = (u cos v, u sin v, u?), 
0<u<2,0<v<2rz. 


(a) Find an equation in x, y, and z describing the surface. 
(b) What are the geometric meanings of the parameters u and v? 
(c) Find a unit vector orthogonal to the surface at ®(u, v). 
(d) Find the equation for the tangent plane at P(uo, vo) = (1, 1, 2) and express your 
answer in the following two ways: 
(i) parametrized by u and v; and 
(ii) in terms of x, y, and z. 


(e) Find the area of S. 


17. Let f(x, y, z) = xe’ coszzz. 


(a) Compute F = Vf. 
(b) Evaluate f, F - ds where c(t) = (3 cos? t, 5sin t,0),0<t < vr. 


18. Let F(x, y, z) = xi + yj + zk. Evaluate ff. F- dS where S is the upper hemisphere of 
the unit sphere x? + y? + z? = 1. 


19. Let F(x, y, z) = xi + yj + zk. Evaluate f. F- ds where e(t) = (e', t, t?),0<t < 1. 
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20. Let F = Vf for a given scalar function. Let c(t) be a closed curve, that is, e(b) = e(a). 
Show that f. F - ds = 0. 


21. Consider the surface (u, v) = (u? cos v, u? sin v, u). Compute the unit normal at 
u = 1, v = 0. Compute the equation of the tangent plane at this point. 


22. Let S be the part of the cone z? = x? + y? with z between 1 and 2 oriented by the normal 
pointing out of the cone. Compute ff; F - dS where F(x, y, z) = (x?, y?, z?). 


23. Let F = xi + x?j + yzk represent the velocity field of a fluid (velocity measured in 
meters per second). Compute how many cubic meters of fluid per second are crossing the xy 
plane through the square 0 <x < 1,0 y <1. 


24. Show that the surface area of the part of the sphere x? + y? + z? = 1 lying above the 
rectangle [—a, a] x [—a, a], where 2a? < 1, in the xy plane is 


25. Let S be a surface and C a closed curve bounding S. Verify the equality 


ffo x F)-d8 = [Feds 


if F is a gradient field (use Review Exercise 20). 


26. Calculate ff. F - dS where F(x, y, z) = (x, y, —y) and S is the cylindrical surface 
defined by x? + y? = 1,0 < z < 1, with normal pointing out of the cylinder. 


27. Let S be the portion of the cylinder x? + y? = 4 between the planes z = 0 and 
z = x + 3. Compute the following: 


(a) fsx? dS (b) ff. y^ dS (c) ff; z dS 
28. Let I be the curve of intersection of the plane z = ax + by with the cylinder 
x? + y? = 1. Find all values of the real numbers a and b such that a? + b? = 1 
and 


e *€-92 - 4 -o. 
r 


29. A circular helix that lies on the cylinder x? + y? = R? with pitch p may be described 
parametrically by 


x = Rcos6, y = Rsin6, z — p6, 020. 
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A particle slides under the action of gravity (which acts parallel to the z axis) without friction 
along the helix. If the particle starts out at the height zo > 0, then when it reaches the height 
z, 0 < z < zo, along the helix, its speed is given by 


ds = y (Zo — z)2g, 


dt 


where s is arc length along the helix, g is the constant of gravity, and t is time. 


(a) Find the length of the part of the helix between the planes z = zp and z = zi, 
0<2z < zo. 
(b) Compute the time 7, it takes the particle to reach the plane z = 0. 


E 


The Integral Theorems 
of Vector Analysis 


All the theory of the motion of fluids has just been reduced to the solution 
of analytic formulas. 


Leonhard Euler 


Fluids are a lot easier to drink than they are to understand. 


Alan Newell 


W eare now prepared to tie together vector differential calculus and vector integral 
calculus. This will be done by means of the important theorems of Green, 
Gauss, and Stokes. We shall also point out some of the physical applications of these 
theorems to the study of electricity and magnetism, hydrodynamics, heat conduction, 
and differential equations. 

The basic integral theorems in vector analysis had their origins in applications. For 
example, Green’s theorem, discovered about 1828, arose in connection with potential 
theory (this includes gravitational and electrical potentials). Gauss’ theorem—the 
divergence theorem—arose in connection with the study of capillarity (this theorem 
should be jointly credited to the Russian mathematician Ostrogradsky, who discovered 
the theorem around the same time as Gauss). Stokes’ theorem was first suggested in 
a letter to Stokes from the physicist Lord Kelvin in 1850 and was used by Stokes on 
the examination for the Smith Prize in 1854. 


8.1 Green’s Theorem 


Green’s theorem relates a line integral along a closed curve C in the plane R? to a 
double integral over the region enclosed by C. This important result will be generalized 
in the following sections to curves and surfaces in IR?. We shall be referring to line 
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integrals around curves that are the boundaries of elementary regions (see Section 5.3). 
To understand the ideas in this section, you may also need to refer to Section 7.2. 


Simple and Elementary Regions and their Boundaries 


A simple closed curve C that is the boundary of an elementary region has two 
orientations—counterclockwise (positive) and clockwise (negative). We denote C 
with the counterclockwise orientation as C*, and with the clockwise orientation as 
C- (Figure 8.1.1). 


ct c- 


Figure 8.1.1 (a) Positive orientation of C, and 
(b) negative orientation of C. 


Positive orientation Negative orientation 


(a) (b) 


The boundary C of a y-simple region can be decomposed into bottom and top 
portions, Cı and C5, and (if applicable) left and right vertical portions, Bı and B. 
Following Figure 8.1.2, we write, 


C=C +B +O +B, 


where the pluses denote the curves oriented in the direction of left to right or bottom to 
top, and the minuses denote the curves oriented from right to left or from top to bottom. 


To C*=C}HB}+C7+B7 ? Cte B+ Cy 


Figure 8.1.2 Two examples showing how to break the positively oriented boundary 
ofa y-simple region D into oriented components. 


We may make a similar decomposition of the boundary of an x-simple region 
into left and right portions, and upper and lower horizontal portions (if applicable) 
(Figure 8.1.3). 
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Ct = CH+ By + Cr+ Bi 
By 


x= 20) 


Figure 8.1.3 An example showing how to break the 
positively oriented boundary of an x-simple region D into 
oriented components. 


Similarly, the boundary of a simple region has two decompositions—one into 
upper and lower halves, the other into left and right halves. 


Green’s Theorem 


We shall now prove two lemmas in preparation for Green’s theorem. 


LEMMA 1 Let D be a y-simple region and let C be its boundary. Suppose 
P: D > Ris of class C!. Then 


P 
[ Pax =— |f Z axa. 
c+ D oy 


(The left-hand side denotes the line integral doe P dx + Qdy where Q = 0.) 
PROOF Suppose D is described by 
asx<b_— p(x) <y <h). 


We decompose C+ by writing Ct = C} + BY + Cy + By (see Figure 8.1.2). By 
Fubini’s theorem, we may evaluate the double integral as an iterated integral and then 
use the fundamental theorem of calculus: 


aP b [^p 
—(x, y) dx d: i f —(x, y)dy dx 
If ay © y)dx dy A M 3y y)dy 


b 
= f [PG do) — P(x, f G)] dx. 
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However, because Cf can be parametrized by x +> (x, $i(x)), a € x < b, and C} 
can be parametrized by x > (x, $2(x)), a € x < b, we have 


b 
[ Pæn dx= f Pos 
a ict 
and 
b 
[ P(x, da(x)) dx =f P(x, y)dx. 
E ct 
Thus, by reversing orientations, 
b 
-f P(x, ġa(x))dx =Í P(x, y) dx. 
a e 


Hence, 


oP 
If dx dy = 1 P dx f P dx. 
D Oy ct C 


Because x is constant on B7 and By, we have 


f ra=0=f Pdx, 
B} By 


2 1 


so 


f Pdx= | Pac f Pax+ [ Pax+ [ pax f Pax+ [ P dx. 
ct ct B} [ory By ct [os 
ll OF idp [ Pdx [ Pax =— [ Pdx. m 
D oy 6E Cr ct 


2 


Thus, 


We now prove the analogous lemma with the roles of x and y interchanged. 


LEMMA 2 Let D be an x-simple region with boundary C. Then if Q: D — IR 


is Cl, 
ody= [[ 90 aed), 
ct D Ox 


The negative sign does not occur here, because reversing the role of x and y corre- 
sponds to a change of orientation for the plane. 
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PROOF Suppose D is given by 


Wily) <x < Wy), czyzd. 


Using the notation of Figure 8.1.3, and noting that y is constant on B} and B; , we 


have 
[ ow= ody=[ a+ [ ody, 
ct Ci BI ERCI B; cy Cy 


where C7 is the curve parametrized by y +> (Y2(y), y), c € y < d, and CË is the 
curve y > (Wi(y), y), c € y x d. Applying Fubini's theorem and the fundamental 
theorem of calculus, we obtain 


nr "n DNE. uo [ "ewon — onok 


1(y) 


1 


Adding the results of Lemmas 1 and 2 proves the following important theorem. 


THEOREM 1: Green’s Theorem Let D be a simple region and let C 
be its boundary. Suppose P: D — R and Q: D — R are of class C!. Then 


Í. Pdx+Qdy= IL GE - ay) d 


The correct (positive) orientation for the boundary curves of region D can be 
remembered by the following device: Zf you walk along the curve C with the correct 
orientation, the region D will be on your left (see Figure 8.1.4). 


Figure 8.1.4 The correct 
orientation for the boundary of a 
region D. 
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Generalizing Green's Theorem 


Green's theorem actually applies to any “decent” region in R°. For instance, Green's 
theorem applies to regions that are not simple, but that can be broken up into pieces, 
each of which is simple. An example is shown in Figure 8.1.5. The region D is 
an annulus; its boundary consists of two curves C = C; + C2 with the indicated 
orientations. (Note that forthe inner region the correct orientation to ensure the validity 
of Green's theorem is clockwise; the device in Figure 8.1.4 still works for remembering 
the orientation!) If Theorem 1 is applied to each ofthe regions Di, D2, D3, and D4 and 
the results are summed, the equality of Green's theorem will be obtained for D and its 
boundary curve C. This works because the integrals along the interior lines in opposite 
directions cancel. This trick, in fact, shows that Green's theorem holds for virtually all 
regions with reasonable boundaries that one is likely to encounter (see Exercise 8). 


G Figure 8.1.5 Green’s theorem applies to 


D = D; U D: U D; U Dy. 


D-DjU D3U DU Dy 


Let us use the notation 3D for the oriented curve Ct, that is, the boundary curve 
of D oriented in the sense as described by the device in Figure 8.1.4. Then we may 
write Green's theorem as 


[oes oo- [f C - an 


Green's theorem is very useful because it relates a line integral around the bound- 
ary of a region to an area integral over the interior of the region, and in many cases 
it is easier to evaluate the line integral than the area integral or vice versa. For exam- 
ple, if we know that P vanishes on the boundary, we can immediately conclude that 
[fj (8 P /3y) dx dy = 0 even though 8.P/8y need not vanish on the interior. (Can you 
construct such a P on the unit square?) 


DWI Verify Green's theorem for P(x, y) = x and Q(x, y) = xy where 
D is the unit disk x? + y? « 1. 


SOLUTION Wedo this by evaluating both sides in Green's theorem directly. The 
boundary of D is the unit circle parametrized by x = cost, y = sint, 0 < t < 27, 
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and so 
Qn 
f Pdx + Qdy = [(cos t)(—sin t) + cos ¢ sint cos t] dt 
aD 0 
E T [ cos? N 
= t|- =0. 
2 Jo 3 Jo 

On the other hand, 


fl (i2- w= fe 


which is also zero by symmetry. Thus, Green's theorem is verified in this case. A 


We can use Green's theorem to obtain a formula for the area of a region bounded by 
a simple closed curve. 


THEOREM 2: Area of a Region If C is a simple closed curve that 
bounds a region to which Green's theorem applies, then the area of the region D 
bounded by C — 8D is 


1 
425] xdy —ydx. 
2 Jap 


PROOF Let P(x, y) = —y, Q(x, y) = x; then by Green's theorem we have 


sf, xdy - ydr =- iff, E aC 22 jaxa 
aD 
=5 [ftv + navay= ff avay =a. u 


DPI B94 Leta > 0. Compute the area (see Figure 8.1.6) of the region en- 
closed by the hypocycloid defined by x?/3 + y?/3 = a?” using the parametrization 


x =acos*6, y — asin? 6, 0x0 x2z. 
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Figure 8.1.6 The hypocycloid 
x —acos!0, y 2 asin 0,0 <6 < 2z. 


SOLUTION From the preceding box, and using the trigonometric identities 
cos? 0 + sin? 0 = 1, sin20 = 2 sinO cos6, and sin? $ = (1 — cos2¢)/2, we get 


4-5] xdy—ydx 
2 Jap 


1 2x 
E [(a cos? 6)(3a sin? 0 cos 0) — (a sin? @)(—3a cos? 0 sin 0)] dO 
0 


3 2n 3 2m 
= P f (sin? 0 cos^ 0 + cos? 0 sint 0) d0 = P [ sin? 0 cos? 0 d0 
0 0 


Sto f as ze [^ (Ee 
== 20 d0 = ~ ——— ) ae 
sa f sin 6 g^ A 2 


2n 3 2x 3 
= zef d6 — i | cos40d0 = >na. A 
0 16 Jo 8 


Vector Form using the Curl 


The statement of Green's theorem can be neatly rewritten in the language of vector 
fields. As we will see, this points the way to one possible generalization to R°. 


THEOREM 3: Vector Form of Green's Theorem Let D C R? be 
a region to which Green's theorem applies, let 3D be its (positively oriented) 
boundary, and let F = Pi + Qj be a C! vector field on D. Then 


[ras [ [mm aa f [o maa 


(see Figure 8.1.7). 
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—————————————— V. 
Figure 8.1.7 The vector form of Green's 
aD theorem. 


F(x, y) 


This result follows from Theorem 1 and the fact that (V x F) -k = 0Q/dx — 0P/dy. 
We ask the reader to supply the details in Exercise 14. 


DENSI O96] LetF = (xy?, y + x). Integrate (V x F) - k over the region in the 
first quadrant bounded by the curves y = x? and y = x. 


SOLUTION Method 1. We first compute the curl 


dF, OF, 
VxF=(0,0, 
ü ( ax ay 


) = (1 —2xy)k. 


Thus, (V x F)-k = 1 — 2xy. This can be integrated over the given region D (see 
Figure 8.1.8) using an iterated integral as follows: 


f[ means f a nivis - [o ts 


1 1 1 
= Dog! ax e a Ii ML 
fe x'—x*4x]dx 2^4 3* 


Figure 8.1.8 The region bounded by the curves 
y — x? and y =x. 


Method 2. Here we use Theorem 3 to obtain 


n xE)kdsay =f Peas. 
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The line integral of F along the curve y = x from left to right is 
1 1 1 5 
f Fidx+ Pdy= | (x3 + 2x)dx 2 2-12 —. 
Along the curve y — x? we get 


! ! iom... 
[ ^s 8o-[ xdx(xc4xQxdx)224l-42-- 
o o 6° BR 8 


Thus, remembering that the integral along y = x is to be taken from right to left, as 
in Figure 8.1.8, 


4 5 1 
Flia m ui 
jm s= 4 B 


Vector Form Using the Divergence 


There is another form of Green's theorem that can be generalized to R3. 


THEOREM 4: Divergence Theorem in the Plane Let D C R? bea 
region to which Green's theorem applies and let 3D be its boundary. Let n denote 
the outward unit normal to aD. If c: [a, b] > R2, t + e(t) = (x(t), y(t)) is a 
positively oriented parametrization of dD, n is given by 
ac OO) 
VOP + D'(OP 


see Figure 8.1.9). Let F = Pi + Qj bea C! vector field on D. Then 
gui J 


f F-nds= ff div F dA. 
aD D 


Figure 8.1.9 n is the outward unit normal to 3D. 
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PROOF Recall that c'(t) = (x'(r), y'(r)) is tangent to 3 D, and note that n - ¢' = 0. 
Thus, n is normal to the boundary. The sign of n is chosen to make it correspond to 
the outward (rather than the inward) direction. By the definition of the line integral 
(see Section 7.2), 


" P(x(t), YOY’) — OOM), YOO OE DOr 
[re - [ aan OF  D'OY at 


b 
= f LPEE, YOY) — OOM), y()x (1 at 


=f P dy — Q dx. 
aD 


By Green’s theorem, this equals 


If, (+2) acay= If, divFdA. m 


DEIJD] LetF = y'i + x?j. Compute the integral ofthe normal component 
of F around the unit square. 


SOLUTION This can be done using the divergence theorem. Indeed, 


fi F-nds = [f div FdA. 
aD D 


But div F = 0, and so the integral is zero. A 


EXERCISES 
1. Evaluate f, y dx — x dy where C is the boundary of the square [—1, 1] x [—1, 1] 
oriented in the counterclockwise direction, using Green's theorem. 


2. Find the area of the disk D of radius R using Green's theorem. 


3. Verify Green's theorem for the disk D with center (0, 0) and radius R and the functions: 


(a) P(x, y) =xy’, Qi, y) = — 
(b PG, y) 2x +y, Qœ, y) = 
(c) P(x, y) 2 xy = Q(x, y) 
(d) P(x, y) = 2y, Qi, y) =x 


4. Using the divergence theorem, show that h p F: nds = 0 where F(x, y) = yi — xj and 
D is the unit disk. Verify this directly. 


5. Find the area bounded by one arc of the cycloid x = a(0 — sin 0), y = a(1 — cos), 
where a > 0, and 0 < 0 < 27, and the x axis (use Green's theorem). 
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6. Under the conditions of Green's theorem, prove that 


(a) f Pods * roay f [lo - m) 


OP p22 dQ oP 
© Kf x rw) +(° wa) 


ot? 
=2 ff (rz axdy ax LI 


7. Evaluate the line integral 


f (x? — y) dx +0? + yay, 
c 


ao a0 
7 ajav 


where C is the unit circle, and verify Green’s theorem for this case. 


8. Prove the following generalization of Green’s theorem: Let D be a region in the xy plane 
with boundary a finite number of oriented simple closed curves. Suppose that by means of a 
finite number of line segments parallel to the coordinate axes, D can be decomposed into a 
finite number of simple regions D; with the boundary of each D; oriented counterclockwise 
(see Figure 8.1.5). Then if P and Q are of class C! on D, 


If E - Fe xdy =f Pdx + Qdy, 
aD 
where 0D is the oriented boundary of D. (Hint: Apply Green's theorem to each D;.) 
9. Verify Green's theorem for the integrand of Exercise 7 (that is, with P — 2x? — y? and 
Q = x? + y?) and the annular region D described by a < x? + y? < b, with boundaries 


oriented as in Figure 8.1.5. 


10. Let D be a region for which Green's theorem holds. Suppose f is harmonic; that is, 


2 2 
PS PI o 
ox? ay? 
on D. Prove that 
oF ax - Lay E 
D ay ^* 


11. (a) Verify the divergence theorem for F = xi -- yj and D the unit disk x? + y? < 1. 
(b) Evaluate the integral of the normal component of 2xyi — yj around the ellipse 
defined by x?/a? + y?/b? = 1. 


12. Let P(x, y) = —y/(x? + y?) and Q(x, y) = x/(x? + y?). Assuming D is the unit disk, 
investigate why Green's theorem fails for this P and Q. 


13. Use Green's theorem to evaluate fo, (y? + x?) dx + x‘ dy, where C* is the perimeter of 
the square [0, 1] x [0, 1] in the counterclockwise direction. 


14. Verify Theorem 3 by showing that (V x F)-k = 3Q/ðx — 9P/9y. 
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15. Use Theorem 2 to compute the area inside the ellipse x?/a? + y?/b? = 1. 
16. Use Theorem 2 to recover the formula A = i d r? d@ for a region in polar coordinates. 


17. Sketch the proof of Green's theorem for the region shown in Figure 8.1.10. 


Figure 8.1.10 Prove Green's theorem for this region. 


18. Prove the identity 


NL - [[ ev» 4 Vó- V$)dA. 


19. Use Green's theorem to find the area of one loop of the four-leafed rose r = 3 sin20. 
(Hint: x dy — y dx = r?d0). 


20. Show that if C is a simple closed curve that bounds a region to which Green's theorem 
applies, then the area of the region D bounded by C is 


4-[ xdys - [ ydx. 
aD aD 


Show how this implies Theorem 2. 

Exercises 21 to 29 illustrate the application of Green s theorem to partial differential 
equations. (Further applications are given in the Internet supplement.) They are particularly 
concerned with solutions to Laplace's equation, that is, with harmonic functions. For these 
exercises, let D be an open region in R? with boundary dD. Let u: DU ðD — R bea 


continuous function that is of class C? on D. Suppose p € D and the closed disks 
B, = B,(p) of radius p centered at p are contained in D for 0 < p < R. Define I(p) by 


I(p) = HI uds. 
D Jab, 


21. Show that limit, ,o (p) = 27u(p). 


22. Let n denote the outward unit normal to 0B, and du/dn = Vu -n. Show that 


[ Aa ff V'udA. 
ap, ON Bp 
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23. Using Exercise 22, show that /'(p) — A/A) ffs, V?^udA. 


24. Suppose u satisfies Laplace’s equation: V?u = 0 on D. Use the preceding exercises to 
show that 


1 
u(p) = mR Es uds. 


(This expresses the fact that the value of a harmonic function at a point is the average of its 
values on the circumference of any disk centered about it.) 


25. Use Exercise 24 to show that if u is harmonic (i.e., if Vu = 0), then u(p) can be 


expressed as an area integral 
u(p) — l f i] udA 
P= RR Br ` 


26. Suppose u is a harmonic function defined on D (i.e., V^u = 0 on D) and that u has a 
local maximum (or minimum) at a point p in D. 


(a) Show that u must be constant on some disk centered at p. (HINT: Use the results of 
Exercise 25.) 

(b) Suppose that D is path-connected [i.e., for any points p and q in D, there is a 
continuous path c: [0, 1] — D such that c(0) = p and c(1) = q] and that for some p the 
maximum or minimum at p is absolute; thus, u(q) < u(p) or u(q) > u(p) for every q in D. 
Show that u must be constant on D. 


(The result in this Exercise is called a strong maximum or minimum principle for 
harmonic functions. Compare this with Exercises 36 to 40 in Section 3.3.) 


27. A function is said to be subharmonic on D if V?u > 0 everywhere in D. It is said to be 
superharmonic if V?u <0. 


(a) Derive a strong maximum principle for subharmonic functions. 
(b) Derive a strong minimum principle for superharmonic functions. 


28. Suppose D is the disk {(x, y) | x? + y? < 1} and C is the circle {(x, y) | x? + y? = 1). 
In the Internet supplement, we shall show that if f is a continuous real-valued function on C, 
then there is a continuous function u on D U C that agrees with f on C and is harmonic on 
D. That is, f has a harmonic extension to the disk. Assuming this, show the following: 


(a) If q is a nonconstant continuous function on D U C that is subharmonic (but not 
harmonic) on D, then there is a continuous function u on D U C that is harmonic on D such 
that u agrees with q on C and q < u everywhere on D. 

(b) The same assertion holds if “subharmonic” is replaced by “‘superharmonic” and 
“q <u” by “gq >u? 


29. Let D be as in Exercise 28. Let f: D — IR be continuous. Show that a solution to the 
equation V?u = 0 satisfying u(x) = f(x) for all x € 3D is unique. 
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30. Use Green's theorem to prove the change of variables formula in the following special 


E ies- 


for a transformation (u, v) > (x(u, v), y(u, v)). 


a(x, y) 
Iu, v) v) dudv 


8.2 Stokes' Theorem 


Stokes’ theorem relates the line integral of a vector field around a simple closed curve 
C in R? to an integral over a surface S for which C is the boundary. In this regard it 
is very much like Green's theorem. 


Stokes' Theorem for Graphs 


Let us begin by recalling a few facts from Chapter 7. Consider a surface S that is the 
graph of a function f(x, y), so that S is parametrized by 


x=u 
yv 
z= f(u, v) 2 f(x. y) 


for (u, v) in some domain D in the plane. The integral of a vector function F over S 
was developed in Section 7.6 as 


ffs- ff lt x)*^( 2) +P Jaxa, a) 


where F = Fii + Foj + Fk. 

In Section 8.1, we first assumed that the regions D under consideration were 
simple; while this was used in our proof of Green’s theorem, we noted there that 
the theorem is valid for a wider class of regions. In this section we assume that D 
is a region whose boundary is a simple closed curve and to which Green’s theorem 
applies. Green’s theorem involves choosing an orientation on the boundary of D, as 
was explained in Section 8.1. The choice of orientation that validates Green’s theorem 
will be called positive. Recall that if D is simple, then the positive orientation is the 
counterclockwise one. 

Suppose that c: [a, b] — R?, e(t) = (x(t), y(t)) is a parametrization of ð D in the 
positive direction. Then we define the boundary curve 8S to be the oriented simple 
closed curve that is the image of the mapping p: t +> (x(t), y(t), f(x(t), y(t))) with 
the orientation induced by p (Figure 8.2.1). 

To remember this orientation (that is, the positive direction) on 0S, imagine that 
you are an “observer” walking along the boundary of the surface with the normal as 
your upright direction; you are moving in the positive direction if the surface is on 
your left. This orientation on aS is often called the orientation induced by an upward 
normal n. 
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Figure 8.2.1 The induced orientation on 9$: As you 
walk around the boundary, the surface should be on 
your left. 


THEOREM 5: Stokes Theorem for Graphs Let S be the 
oriented surface defined by a C? function z = f(x, y), where (x, y) e D, a 
region to which Green's theorem applies, and let F be a C! vector field on S. 
Then if 8S denotes the oriented boundary curve of S as just defined, we have 


[[ 29-45 f [v «a5 [ van 


Remember that f, as F + ds is the integral around 0S of the tangential component 
of F, while [fẹ G - dS is the integral over S of G - n, the normal component of G (see 
Sections 7.2 and 7.6). Thus, Stokes’ theorem says that the integral of the normal 
component of the curl of a vector field F over a surface S is equal to the integral of 
the tangential component of F around the boundary 88. 


PROOF If F= Fiji Fj + Fk, then 
OF; OF, OF, dF3\, dF, OF 
1F= i —— |k 
- E: eG a CE =) 
Therefore, we use formula (1) to write 
OF; OF) Oz 
1F-dS = 
f[osem- |p) Ce) 


OF, OF; Oz OF, OF, 
== 7 2 
dc 3l s) 5)]^ e 


534 


The Integral Theorems of Vector Analysis 


On the other hand, 


f Feds= [ F-as = [ he have has 
as P P 


where p: [a, b] > R?, p(t) = (x(t), y(t), f(x(t), y) is the orientation-preserving 
parametrization of the oriented simple closed curve 3S discussed earlier. Thus, 


dx dy dz 
[ras [ (8 «n2 «nt)a. 68) 


By the chain rule 


dz _ dada: , Ordy 
dt  üxdt  8ydt 


Substituting this expression into equation (3), we obtain 
b 
O0zY dx Oz dy 
F-ds= Fi Fi FF dt 
[r- [n r (5 5)] 
Oz az 
=f Fi + F — | dx +| P +F; — | dy (4) 
c ax oy 
az 
= Fi + F — )dx+ F, + P @ dy. 
aD ax b 


Applying Green's theorem to equation (4) yields (we are assuming that Green's the- 
orem applies to D) 


I [= +F30z/dy)  a(Fit+ Fs zen] dA 
Ox oy 


Now we use the chain rule, remembering that F;, F2, and F; are functions of x, y, 
and z and that z is a function of x and y, to obtain 


OF, 4 oF dz 0F;,8z AF; dz Az az 
+ + tA 
a T Oz Ox | Ox dy dz Ox dy ðxðy 
F F 2 
oF, Sm BO, S EON op E dA 
ðy az dy dy Ox az dy Ox ðyðx 


Because mixed partials are equal, the last two terms in each parenthesis cancel each 
other, and we can rearrange terms to obtain the integral of equation (2), which 
compietes the proof. m 


8.2 Stokes' Theorem 535 


DOSES LetF = yeři + xe*j + xye*k. Show that the integral of F around 
an oriented simple closed curve C that is the boundary of a surface S is 0. (Assume 
S is the graph of a function, as in Theorem 5.) 

SOLUTION Indeed, fo F-ds = ff, (V x F)-dS, by Stokes’ theorem. But we 
compute 


i i k 
VxF= 9 2 2 =0, 
Ox Oy Oz 
ye? xe? xye* 


and so f, F - ds = 0. Alternatively, we can observe that F = V(xye*), so its integral 
around a closed curve is zero. A 


(NOS 14554) Use Stokes’ theorem to evaluate the line integral 
[ —y! dx +x’ dy — zi dz, 
[e 


where C is the intersection of the cylinder x? + y? = 1 and the plane x + y +z = 1, 
and the orientation on C corresponds to counterclockwise motion in the x y plane. 


SOLUTION The curve C bounds the surface S defined by the equation z = 
1— x — y = f(x, y) for (x, y) in the set D = {(x, y) | x? + y? < 1) (Figure 8.2.2). 
We set F = —3?i E x?j — z?k, which has curl V x F = (3x? + 3y°)k. Then, by 


Figure 8.2.2 The curve C is the intersection 
of the cylinder x? + y? = 1 and the plane 
x+ytz=1. 
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Stokes’ theorem, the line integral is equal to the surface integral 


[[vx»-«s. 


But V x F has only a k component. Thus, by formula (1) we have 


ffo x F)-dS = [[ee + 3y") dx dy. 


This integral can be evaluated by changing to polar coordinates. Doing this, we get 


1 p2x 1 
3 f[ easy =3 | f r-rdodr = 6n | Pme rS 
D o Jo 0 4 2 


Let us verify this result by directly evaluating the line integral 
| —ydx + x? dy — zi dz. 
g 
We can parametrize the curve 3 D by the equations 
x = cost, y — sint, a= 0; 0<t<2z. 
The curve C is therefore parametrized by the equations 
x — cost, y — sint, z = l —sint — cost, 0<t<2rz. 
Thus, 
[ — y! dx +x’ dy — zd dz 
(o 


2x 
= [(—sin? r)(—sin £) + (cos? £)(cos t) 
0 


— (1 — sint — cost)}(—cost + sin t)] dt 
2n 2n 
= (cos* t + sin* t) dt (1 — sint — cos t) (—cos t + sint)dt. 
0 0 


The second integrand is of the form i? du, where u = 1 — sin t — cost, and thus the 
integral is equal to 


1 
ql — sint — cost)*]?" = 0. 
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Hence, we are left with 


Qn 
(cos? t + sin t) dt. 


This integral can be evaluated using formulas (18) and (19) of the table of inte- 
grals. We can also proceed as follows. Using the trigonometric identities 
Sc 1 — cos 2t e 1 + cos2t 
3 7 EE UN 


substituting and squaring these expressions, we reduce the preceding integral to 


1 2n 


1 2n 
= | Q-co?204t =n + zS cos? 2t dt. 
2 Jo 2 Jo 


Again using the identity cos? 2t = (1 + cos 4t)/2, we find 


1 f* 1 1 f* 
zi (1-ecosardt o e | acil cos 4t dt 
4 Jo 4 Jo 4 Jo 


n 3 
= = 0=—. 
zTtyt 


2 A 


Stokes' Theorem for Parametrized Surfaces 


To simplify the proof of Stokes' theorem given earlier, we assumed that the surface 
S could be described as the graph of a function z = f(x, y), (x, y) € D, where D 
is some region to which Green's theorem applies. However, without too much more 
effort we can obtain a more general theorem for oriented parametrized surfaces S. 
The main complication is in the definition of 0S. 

Suppose 4: D — R? isa parametrization ofa surface S and c(t) = (u(t), v(t)) is 
a parametrization of dD. We might be tempted to define 3S as the curve parametrized 
by t > p(t) = (u(t), v(t)). However, with this definition, 0S might not be the 
boundary of S in any reasonable geometric sense. 

For example, we would conclude that the boundary of the unit sphere S 
parametrized by spherical coordinates in IR? is half of the great circle on S lying 
in the xz plane, but clearly in a geometric sense S is a smooth surface (no points or 
cusps) with no boundary or edge at all (see Figure 8.2.3 and Exercise 20). Thus, this 
great circle is in some sense the *mistaken" boundary of S. 

We can get around this difficulty by assuming that ® is one-to-one on all of D. 
Then the image of dD under ®, namely, #(3 D), will be the geometric boundary of 
S = ®(D). If c(t) = (u(t), v(t)) is a parametrization of ðD in the positive direction, 
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Figure 8.2.3 The surface S is a portion of a sphere. 


*Mistaken" boundary of S 


we define 95 to be the oriented simple closed curve that is the image of the mapping 
p: (e (u(t), v(t)), with the orientation of 3S induced by p (see Figure 8.2.1). 


THEOREM 6: Stokes’ Theorem: Parametrized Surfaces Let S be 
an oriented surface defined by a one-to-one parametrization ®: D C R? > S, 
where D is a region to which Green's theorem applies. Let 0S denote the oriented 
boundary of S and let F be a C' vector field on S. Then 


[| x Fas - Feds. 


If S has no boundary, and this includes surfaces such as the sphere, then the 


integral on the left is zero (see Exercise 17). 


This is proved in the same way as Theorem 5. 


BASNI] Let S be the surface shown in Figure 8.2.4, with the indicated 
orientation. Let F = yi — xj + e**k. Evaluate Ws (V x F)-dS. 


SOLUTION This surface could be parametrized using spherical coordinates 
based at the center of the sphere. However, we need not explicitly find ® in or- 
der to solve this problem. By Theorem 6, [f. (V x F)-dS = f,. F- ds, and so if we 
parametrize 0S by x(t) = cost, y(t) = sint, 0 < t < 27, we determine 


Dx d dy 2n " 2 2r 
. = ELT uy aa = i — =-9 
IE ds [ 6 xa at [ (— sin* t — cos" t) dt f dt 7 


and therefore ff. (V x F)-dS=—2n. A 
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Figure 8.2.4 This surface S is a portion of a sphere sitting 
on top of the circle x? + y? = 1. It does not include the disk 
x? + y? < Lin the xy plane. 


'The Curl as Circulation per Unit Area 


Let us now use Stokes’ theorem to justify the physical interpretation of V x F interms 
of paddle wheels that was proposed in Chapter 4. Paraphrasing Theorem 6, we have 


[ [emnes = f [mas = f F-ds= Fr ds, 
S S as as 


where F7 is the tangential component of F. This says that the integral of the normal 
component of the curl of a vector field over an oriented surface S is equal to the line 
integral of F along ðS, which in turn is equal to the path integral of the tangential 
component of F over 0S. 

Suppose V represents the velocity vector field of a fluid. Consider a point P and a 
unit vector n. Let S, denote the disk of radius p and center P, which is perpendicular 
to n. By Stokes' theorem, 


If curl v-as = ff curl V-ndS = V-ds, 
Sp 8; 8S, 


where 3S, has the orientation induced by n (see Figure 8.2.5). 


Figure 8.2.5 A normal n induces an orientation on the 
boundary ðS, of the disk Sp. 


as, 
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By the mean-value theorem for integrals (Exercise 12, Section 7.6), there is a 
point Q in S, such that 


ll curl V: nd S = [curl V(Q)- n]A(S,) 
S, 


where A(S,) = 2p? is the area of Sp, curl V(Q) is the value of curl V at Q. Thus, 


1 1 
limit V-ds = limit If curl V)-dS 
526 AOD Jas, oe ad 41, "9 9 


= limit curl V(Q)- n = curl V(P)* n. 
p 
Thus;! 


curl V(P)- n = limit 
®) p>0 A(S,) Jas, 


V - ds. (5) 


Let us pause to consider the physical meaning of f, c V + ds when V is the velocity 
field of a fluid. Suppose, for example, that V points in the direction tangent to the 
oriented curve C (Figure 8.2.6). Then clearly fọ V - ds > 0, and particles on C tend to 
rotate counterclockwise. If V is pointing in the opposite direction, then f. cV: ds «0 
and particles tend to rotate clockwise. If V is perpendicular to C, then particles 
don't rotate on C at all and fo V -ds = 0. In general, fo V - ds, being the integral 
of the tangential component of V, represents the net amount of turning of the fluid 
in a counterclockwise direction around C. One therefore refers to fo V - ds as the 
circulation of V around C (see Figure 8.2.7). 


V v 


[V5 0 [v0 [va 0 


Figure 8.2.6 The intuitive meaning of the possible signs of f V - ds. 


! Some informal texts adopt equation (5) as the definition of the curl, and use it to “prove” Stokes’ theorem. However, this 
raises the danger of circular reasoning, for to show that equation (5) really defines a vector “curl V(P)" requires Stokes’ 
theorem, or some similar argument. 
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Motion 
of fluid 
particles 


M 
V, 
Motion | 
of fluid V2) 
particles V 
C 
y, z) 


/ 


(a) (b) 


Figure 8.2.7 Circulation of a vector field (velocity field of a fluid): (a) Circulation 
about C is zero; (b) nonzero circulation about C (“whirlpool”). 


These results allow us to see just what curl V means for the motion of a fluid. The 
circulation h s, V - ds is the net velocity of the fluid around 3 S,, so that (curl V)- n 
represents the turning or rotating effect of the fluid around the axis n. 


Circulation and Curl The dot product ofcurl V(P) with a unit vector n, namely, 
curl V(P) - n, equals the circulation of V per unit area at P on a surface perpen- 


dicular to n. 


Observe that the magnitude of curl V(P) - n is maximized when n = curl V/||curl V || 
(evaluated at P). Therefore, the rotating effect at P is greatest about the axis that is 
parallel to curl V/||curl V ||. Thus, curl V is aptly called the vorticity vector. 

We can use these ideas to compute the curl in cylindrical coordinates. 


Let the unit vectors e,, eo, e; associated to cylindrical coordinates 
be as shown in Figure 8.2.8. Let F = Fe, + Fseo + F;e;. (The subscripts here denote 
components of F, not partial derivatives.) Find a formula for the e. component of 
V x F in cylindrical coordinates. 


SOLUTION Let S be the surface shown in Figure 8.2.9. 
The area of S is r d0 dz and the unit normal is e,. The integral of F around the 
edges of S is approximately 
[Fo(r, 6, z) — Fo(r, 0, z + dz)]r d0 + [F,(r, 0 + d0, z) — F,(r, 0, z)| dz 


OF, 


ar, 
& — — dzr do + — d6 dz. 
az a0 
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Figure 8.2.8 Orthonormal vectors e,, eg, and e; associated 
with cylindrical coordinates. The vector e, is parallel to the 
line labeled r. 


Figure 8.2.9 A surface element in cylindrical coordinates. 


d6 


Thus, the circulation per unit area is this expression divided by r d0dz, namely, 


According to the previous box, this must be the e, component of the curl. A 


Gradient, Divergence, and Curl in Cylindrical and 
Spherical Coordinates 


By similar arguments to Example 4, one finds that the curl in cylindrical coordinates 
is given by 


e. reg € 


vxp-19 9 9| 
r|or 00 @z 
F, rF& F, 


We can find other important vector quantities expressed in different coordinate sys- 
tems. For example, the chain rule shows that the gradient in cylindrical coordinates is 
of lof of 


Via oe M. VAT, 
f= ane Pag utat 
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and in Section 8.4 we will establish related techniques that give the following formula 
for the divergence in cylindrical coordinates: 


1fa dF). a 
V-F=- F, Fe) |< 
r [že D+ ta; | 
Corresponding formulas for gradient, divergence, and curl in spherical coordinates are 


af. LEE 1 


Vf = 2 
f ap + 59g * psing 36 
1 1 OF 
V-F= 2p 
pap? P» gr" TI IL Fo) + pang 86 
and 
1 1 dF, 
VERE [= TEN cidem xe 


1 8F, 1 18 18F, 
i : F; / F, 
F sing 00 p pe »| SP k ap” o) p ð$ Je 


where ep, €j, eg are as shown in Figure 8.2.10 and where F = Fe, + Fyeg + Feo. 


Figure 8.2.10 Orthonormal vectors e,, eg, and eg associated with 
spherical coordinates. 


Faraday's Law 


Vector calculus plays an essential role in the theory of electromagnetism. The next 
example shows how Stokes' theorem applies. 


DENIA] Let E and H be time-dependent electric and magnetic fields, 
respectively, in space. Let S be a surface with boundary C. We define 


[ E-ds = voltage around C, 
c 


i ji H - dS = magnetic flux across S. 
s 
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Faraday's law (see Figure 8.2.11) states that the voltage around C equals the negative 
rate of change of magnetic flux through S. Show that Faraday's law follows from the 
following differential equation (one of the Maxwell equations): 


aisi. 
at 


Figure 8.2.11 Faraday’s law. 


SOLUTION Assume that —-dH/dt = V x E holds. By Stokes’ theorem, 


[e ffo «mas. 


Assuming that we can move 0/01 under the integral sign, we get 


-å [ [55 [[ 557 [fv mas [s 
[£575 ffias, 


and so 


which is Faraday'slaw. A 


Supplement to Section 8.2: Stokes' Theorem, Astronauts, 
and Falling Cats 


Falling Cats 


Have you ever wondered how a falling cat can right itself? Released from a resting 
position with its feet above its head, the cat is able to execute a 180? reorientation and 
land safely on its feet. This well-known phenomenon has fascinated people for many 
years—especially in cities like New York, where cats have been known to survive 
falls of 8 to 30 stories! 

There have been many incorrect explanations as to how cats are able to right 
themselves, including the idea that it has to do with how the cat twirls its tail. This 
cannot be right because Manx cats, which have no tails, can also perform this feat! 


d34331e€ 


Figure 8.2.12 
The falling cat 
rights itself by 
wriggling its 
body parts. 
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One observes, as in Figure 8.2.12, that the cat achieves this net change in ori- 
entation by wriggling, to create changes in its internal shape or configuration. On 
the surface, this provides a seeming contradiction; because the cat is dropped from 
a resting position, it has zero angular momentum at the beginning of the fall and 
hence, according to a basic law of physics called conservation of angular momentum, 
the cat has zero angular momentum throughout the duration of its fall? Amazingly, 
the cat has effectively changed its angular position while maintaining zero angular 
momentum! 

The exact process by which this occurs is subtle; intuitive reasoning can lead one 
astray and, as we have indicated, many false explanations have been offered throughout 
the history of trying to solve this mystery.? Recently, new and interesting insights have 
been discovered using geometric methods that, in fact, are related to curvature (see 
Section 7.7). 

The way that curvature and geometry are related to the falling cat phenomenon 
is not easy to explain in full detail, but we can explain a similar phenomenon that is 
easy to understand. One of the points to emphasize is that Stokes ' theorem is the key 
to proving all of the relevant theorems. 


Reorienting Astronauts 


Another example to help visualize this effect is to consider astronauts who wish to re- 
orient themselves in a free-space environment. As with the falling cat, this motion can 
again be achieved using internal gyrations, or shape changes. For instance, consider 
astronauts moving their arms much like the motion of arms stirring liquid in a large 
kettle. The arms are held out forward, to lie in a horizontal plane that goes through 
the shoulders, parallel to the floor; the hands are clasped together and remain in this 
horizontal plane during the circular stirring motion. At the point of maximum exten- 
sion of the arms, the inertia of the body about a vertical axis is also at a maximum. 
Conservation of angular momentum requires that the body rotate in an opposite and 
proportional manner to the motion of the arms. As the arms rotate around and are 
brought in, however, the inertia of the body is reduced. The motion of the body in 
reaction is therefore also reduced. Thus, in one complete cycle of arm movement, the 
body undergoes a net rotation opposite the direction of arm motion. When the desired 


*We saw an instance of the law of conservation of angular momentum in Section 4.1, Exercise 20. 


? Another favorite fallacious argument, showing that a cat cannot turn itself over(!), is this: "Accept from physics that 
angular momentum is the moment of inertia times angular velocity [moments of inertia are discussed in Section 6.3]. But 
the angular momentum of the cat is zero, so the angular velocity must also be zero. Because angular velocity is the rate 
of change of the angular position, the angular position is constant. Thus, the cat cannot turn itself over.” What is wrong? 
This argument ignores the fact that the cat changes its shape, and hence its moment of inertia, during the fall. 


*See T. R. Kane and M. Scher, "A Dynamical Explanation of the Falling Cat Phenomenon,” Int. J. Solids Struct., 5 
(1969): 663-670. See also R. Montgomery, “Isoholonomic Problems and Some Applications; Commun. Math. Phy: 
128 (1990): 565-592; R. Montgomery, “How Much Does a Rigid Body Rotate? A Berry's Phase from the 18" Centu: 
Am. J. Pys., 59 (1991): 394—398. See also J. E. Marsden and J. Ostrowski, “Symmetries in Motion: Geometric Foundations 
of Motion Control,” Nonlinear Science Today (1998), http://link.springer-ny.com; R. Batterman, “Falling Cats, Parallel 
Parking, and Polarized Light,” Philos. Soc. Arch. (2002); http://philsci-archive.pitt-edu/documents/disk0/00/00/05/83, 
http://www. its.caltech.edu/~mleok/falling-cats.htm, and references therein. 
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orientation is achieved, the astronaut needs merely to stop the arm motion in order to 
come to rest. One often refers to the extra motion that is achieved as geometric phase. 


Link with Non-Euclidean Geometry 


The theory of geometric phases also shows up in an interesting way in non-Euclidean 
geometry—as in the geometry of triangles drawn on a sphere. A simple way to 
explain this link is as follows. Hold your hand at arm’s length, but allow rotation in 
your shoulder joints. Move your hand along three great circles, forming a triangle 
on the sphere; during the motion along each arc, always keep your thumb parallel; 
that is, it should move in such a way that it forms a fixed angle with the direction of 
motion along each arc and does not rotate when switching arcs. After completing the 
circuit around the triangle, your thumb will return rotated through an angle relative to 
its starting position (see Figure 8.2.13). Can you see in Figure 8.2.13 that the angle of 
rotation is 90° (or 7/2 radians) and that this is what happens when you do the thumb 
experiment yourself? 


Figure 8.2.13 A parallel movement of your thumb around a 
spherical triangle produces a phase shift. 


For general spherical triangles, this angle (in radians) is given by © = A — 7, 
where A is the sum of the angles of the triangle. The fact that © is strictly positive 
(!) is one of the basic truths of non-Euclidean geometry—the sum of the angles of 
a right triangle on a sphere is greater than z! This angle is also related to the area 
A enclosed by the triangle through the relation © = A/r?, where r is the radius of 
the sphere. The rotational shift of the thumb during the course of its cyclic journey 
around the spherical triangle is directly related to the curvature of the sphere and to 
the area enclosed by the path that is traced out. Notice first that for a spherical triangle 
that is 1/8 of the sphere, A = 47r? /8 = zr?/2. Thus, A/r? = 2/2. Notice also that 
when r — oc, the sphere becomes flatter and thus approaches a Euclidean plane, in 
which case O — 0. 

The cyclic journey of the thumb around the closed path is analogous to the cyclic 
internal motions made by the cat during its fall; the 90° shift in the direction of 
the thumb after one trip around is analogous to the 180° reorientation of the cat. A 
deeper look at the underlying mathematics shows that, in fact, they are both instances 
of the same phenomenon (called Aolonomy)—and Stokes’ theorem is the key to 
understanding it. 
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EXERCISES 
1. Redo Exercise 5 of Section 7.6 using Stokes' theorem. 


2. Redo Exercise 6 of Section 7.6 using Stokes' theorem. 


3. Verify Stokes’ theorem for the upper hemisphere z = y1 — x? — y?, z > 0, and the 
radial vector field F(x, y, z) = xi + yj + zk. 


4. Let S bea surface with boundary 3S, and suppose E is an electric field that is 
perpendicular to 3S. Show that the induced magnetic flux across S is constant in time. (HINT: 
Use Faraday's law.) 


5. Let S be the capped cylindrical surface shown in Figure 8.2.14. S is the union of two 
surfaces, S, and S2, where S; is the set of (x, y, z) with x? + y? = 1,0 < z < 1, and S; is 
the set of (x, y, z) with x? + y? + (z — 1? = 1,z > 1. Set F(x, y, z) = (zx + zy - x) + 
(z3yx + y)j + z4x?k. Compute ff. (V x F)- dS. (Hint: Stokes’ theorem holds for this 
surface.) 


Figure 8.2.14 The capped cylinder is the union of S; 
and S5. 


6. Let c consist of straight lines joining (1, 0, 0), (0, 1, 0), and (0, 0, 1) and let S be the 
triangle with these vertices. Verify Stokes’ theorem directly with F = yzi + xzj + xyk. 


7. Evaluate the integral ff.(V x F)- dS, where S is the portion of the surface of a sphere 
defined by x? + y? +2? = 1 and x + y +z > 1, and where = r x (i +j + k), 
r=xi+yj+zk. 


8. Show that the calculation in Exercise 7 can be simplified by observing that 
Jas E- dr = f, F - dr for any other surface X. By picking X appropriately, ff. (V x F)-dS 
may be easy to compute. Show that this is the case if X is taken to be the portion of the plane 
x+y +z = l inside the circle 0S. 


9. Calculate the surface integral [f.(V x F)- dS where S is the hemisphere x?+y?°+2° = 1, 
x > Oand F = x?i — yj. 
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10. Find ff.(V x F)-dS, where S is the ellipsoid x? + y? + 27? = 10 and F is the vector 
field F = (sinxy)i + e*j — yzk. 


11. Let F = yi — xj + zx3y?k. Evaluate //,(V x F)-ndA, where S is the surface defined 
byx? +y? 4+ 2=1,2<0. 


12. A hot-air balloon has the truncated spherical shape shown in Figure 8.2.15. The hot 
gases escape through the porous envelope with a velocity vector field 


V(x,y,z) = V x (x, y, z) where S (x, y, z) = —yi + xj. 


If R — 5, compute the volume flow rate of the gases through the surface. 


Figure 8.2.15 A hot-air balloon. 


13. Prove that Faraday’s law implies V x E = —dH/dt. 


14. Let S be a surface and let F be perpendicular to the tangent to the boundary of S. Show 


that 
[ [or xas =o. 


What does this mean physically if F is an electric field? 


15. Consider two surfaces Sı, S2 with the same boundary 0S. Describe with sketches how S; 
and S, must be oriented to ensure that 


[[ mms ff mas 


16. Fora surface S and a fixed vector v, prove that 


a f [ vas f eas. 


where r(x, y, z) — (x, y, z). 
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17. Argue informally that if S is a closed surface, then 


[m «9-0 


(see Exercise 15). (A closed surface is one that forms the boundary of a region in space; thus, 
for example, a sphere is a closed surface.) 


18. If C is a closed curve that is the boundary of a surface S, and f and g are C? functions, 
show that 


(a) [ set [[oor vas 
S 


(b) f (Vg - gVf)-ds -0 


19. (a) If C isa closed curve that is the boundary of a surface S, and v is a constant vector, 


show that 
i v-ds=0. 
C 


(b) Show that this is true even if C is not the boundary of a surface S. 


20. Show that &: D > R?, D = [0, x] x [0, 277], (9, 0) = (cos @ sing, sin@ sing, cos ¢), 
which parametrizes the unit sphere, takes the boundary of D to half of a great circle on S. 


21. Verify Theorem 6 for the helicoid ®(r, 0) = (r cos 6, r sin 0, 0), (r, 0) € [0, 1] x 
[0, 2/2], and the vector field F(x, y, z) = (z, x, y). 


22. Prove Theorem 6. 


23. Let F = x?i + (2xy + x)j + zk. Let C be the circle x? + y? = 1 and S the disk 
x? + y? < 1 within the plane z = 0. 


(a) Determine the flux of F out of S. 
(b) Determine the circulation of F around C. 
(c) Find the flux of V x F. Verify Stokes’ theorem directly in this case. 


24. Let S be a surface with boundary 3S, and suppose that E is an electric field that is 
perpendicular to 9.5. Use Faraday’s law to show that the induced magnetic flux across S is 
constant in time. 


25. Integrate V x F, F = (3y, —xz, —yz?) over the portion of the surface 2z = x? + y? 
below the plane z — 2, both directly and by using Stokes' theorem. 


26. Ampére's law states that if the electric current density is described by a vector field J and 
the induced magnetic field is H, then the circulation of H around the boundary C of a surface 
S equals the integral of J over S (i.e., the total current crossing S). See Figure 8.2.16. Show 
that this is implied by the steady-state Maxwell equation V x H — J. 
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Figure 8.2.16 Ampére’s law. 


current / — flux 
ofJ 


27. Faraday’s law relates the line integral of the electric field around a loop C to the surface 
integral of the rate of change of the magnetic field over a surface S with boundary C. 
Regarding the equation V x E = —0H/9t as the basic equation, Faraday's law is a 
consequence of Stokes' theorem, as we have seen in Example 4. 

Suppose we are given electric and magnetic fields in space that satisfy V x E — 
—9H/8t. Suppose C is the boundary of the Möbius band shown in Figures 7.6.3 and 7.6.4. 
Because the Móbius band cannot be oriented, Stokes’ theorem does not apply. What becomes 
of Faraday's law? What do you guess fọ E- ds equals? 


28. (a) If in spherical coordinates, we write 
e, = ai + Bj + yk, find a, f, and y. 


(b) Find similar formulas for e, and eg. 


8.3 Conservative Fields 


We saw in Section 7.2 that for a gradient force field F = Vf, line integrals of F were 
evaluated as follows: 


f F- ds = f(c(b)) — f(c(a)). 


The value of the integral depends only on the endpoints c(5) and c(a) of the path. In 
other words, if we used another path with the same endpoints, we would still get the 
same answer. This leads us to say that the integral is path-independent. 

Gradient fields are important in many physical problems. For example, if V = — f 
represents a potential energy (gravitational, electrical, and so on), then F represents 
a force? Consider the example of a particle of mass m in the field of the earth; in 
this case, one takes f to be Gm M/r or V = —GmM/r, where G is the gravitational 
constant, M is the mass of the earth, and r is the distance from the center of the 


5If the minus sign is used, then V is decreasing in the direction F. 
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earth. The corresponding force is F = —(GmM/r?)r = —(GmM/r?)n, where n is 
the unit radial vector. Note that F fails to be defined at the point r — 0. 


When are Vector Fields Gradients? 


We wish to characterize those vector fields that can be written as a gradient. Our task 
is simplified considerably by Stokes' theorem. 


THEOREM 7: Conservative Fields Let F be a C! vector field defined 
on R? except possibly for a finite number of points. The following conditions on 
F are all equivalent: 


(i) For any oriented simple closed curve C, fọ F - ds = 0. 


(ii) For any two oriented simple curves C, and C3 that have the same endpoints, 


] raf F-ds. 
Ci C; 


(iii) F is the gradient of some function f; that is, F = V f (and if F has one or 
more exceptional points where it fails to be defined, f is also undefined 
there). 

(iv) Vx F=0. 


A vector field satisfying one (and, hence, all) of the conditions (i)—(iv) is 
called a conservative vector field.^ 


PROOF We shall establish the following chain of implications, which will prove 
the theorem: 


(i) = (ii) > (iii) > (iv) > (i). 


First we show that condition (i) implies condition (ii). Suppose c, and c? are 
parametrizations representing C, and C5, with the same endpoints. Construct the 
closed curve c obtained by first traversing e; and then —c» (Figure 8.3.1), or, symbol- 
ically, the curve c = e, — ¢2. Assuming c is simple, condition (i) gives 


[ra f F-ds- | F-ds=0, 
e € € 


and so condition (ii) holds. (If ¢ is not simple, an additional argument, omitted here, 
is needed.) 


In the plane R?, exceptional points are not allowed (see Exercise 12). Theorem 7 can be proved in the same way if F is 
defined and is of class C' only on an open convex set in R? or R?. (A set D is convex if P, Q € D implies the line joining 
P and Q also belongs to D.) 
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eb) = ¢2(b) 


Figure 8.3.1 Constructing (a) an 
€-6,- €; oriented simple closed curve c, — ¢2 from 
(b) two oriented simple curves. 


(a) 


(0, 0, 0) 


(x, 0, 0) 


(a) = Fa 


Next we prove that condition (ii) implies condition (iii). Let C be any oriented 
simple curve joining a point such as (0, 0, 0) to (x, y, z), and suppose C is represented 
by the parametrization c [if (0, 0, 0) is the exceptional point of F, we can choose 
a different starting point for e without affecting the argument]. Define f(x, y, z) to 
be f, F- ds. By hypothesis (ii), f(x, y, z) is independent of C. We shall show that 
F = grad f. Indeed, choose c to be the path shown in Figure 8.3.2, so that 


z 


x o 
foe. y, j= Fig, 0, oar f Fs, Oai + f F(x,y, t)dt, 
0 0 0 


where F = (Fi, Fo, F3). 


Q5 y. z) 


j Figure 8.3.2 A path joining (0, 0, 0) to (x, y, z). 


It follows from the fundamental theorem of calculus that 0f/dz = F3. We can 
repeat this processing using two other paths from (0, 0, 0) to (x, y, z) [for example, 
by drawing the lines from (0, 0, 0) to (0, y, 0) to (x, y, 0) to (x, y, z)], and we can 
similarly show that 0f/dx = F, and 8f/8y = F (see Exercise 22). Thus, Vf = F. 

Third, condition (iii) implies condition (iv), because, as proved in Section 4.4, 


V x Vf —0. 


Finally, let c represent a closed curve C and let S be any surface whose boundary 
is c (if F has exceptional points, choose S to avoid them). Figure 8.3.3 indicates that we 
can probably always find such a surface; however, a formal proof of this would require 
the development of more sophisticated mathematical ideas than we can present here. 


8.3 Conservative Fields 553 


By Stokes' theorem, 


[F-as= [rn f [i mas f [mn as. 


Because V x F = 0, this integral vanishes, so that condition (iv) = condition (i). m 


[e 


Figure 8.3.3 A surface S spanning a curve C. 


T di 


Physical Interpretations of fo F - ds 


We have already seen that one interpretation ofthe line integral is as the work done by 
F in moving a particle along C. A second interpretation is the notion of circulation, 
which we encountered at the end ofthe last section. Recall that in this case, we think of 
F as the velocity field ofa fluid; that is, to each point P in space, F assigns the velocity 
vector of the fluid at P (in the last section, F was designated V). Take C to be a closed 
curve, and let As be a small directed chord of C. Then F - As is approximately the 
tangential component of F times || As||. The circulation le F - ds is the net tangential 
component around C. A small paddle wheel placed in the fluid would rotate if it is 
centered at a point where F vanishes and if the circulation of the fluid is nonzero, or 
Je E- ds + 0 for small loops C (see Figure 8.3.4). 


Figure 8.3.4 fo F -ds # 0 implies that a 
paddle wheel in a fluid with velocity field F will 
rotate around its axis. 


There is a similar interpretation in electromagnetic theory: If F represents an 
electric field, then a current will flow around a loop C if fo F - ds # 0. 
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By Theorem 7, a field F has no circulation if and only if curl F = V x F = 0. 
Hence, a vector field F with curl F — 0 is called irrotational. We have therefore 
proved that a vector field in IR? is irrotational if and only if it is a gradient field for 
some function, that is, if and only if F — Vf. The function f is called a potential 
for F. 


955] Consider the vector field F on R? defined by 
F(x, y, z) = yi + (z cos yz + x)j + (y cos yz)k. 


Show that F is irrotational and find a scalar potential for F. 


SOLUTION We compute V x F: 


i j k 

a a a 

VxF-2|— — = 

x Ox oy Oz 
y x-zcosyz y cos yz 


= (cos yz — yz sin yz — cos yz + yz sin yz)i + (0 — 0)j + (1 — Dk 

= 0i+ 0j + 0k = 0, 
so F is irrotational. Thus, a potential exists by Theorem 7. We can find it in several 
ways. 


Method 1. By the technique used to prove that condition (ii) implies condition 
(iii) in Theorem 7, we can set 


x y z 
fe.».2- f F.0.9at + f Pæt Odet | Fy(x, y, t)dt 
0 0 0 


x y z 
af odis. [ xat+ f y cos yt dt 
0 0 0 


=04+xy+sinyz = xy + sin yz. 
One easily verifies that Vf = F, as required: 


V= Sf ig S 54 kaitza AE: 
Ox oy 0z 


Method 2. Because we know that f exists, we know that we can solve the 
system of equations 


a 9, a 
a anys ae ers nn 
Ox oy Oz 
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for f(x, y, z). These are equivalent to the simultaneous equations 
(a) f(x, y,z) =xy +hi(y, 2) 

(b) f(x, y,z) = sinyz + xy + ha(x, z) 

(c) f(x,y, z) = sinyz + hs(x, y) 


for functions A1, h2, h3 independent of x, y, and z (respectively). When Ai(y, z) = 
sin yz, h2(x, z) = 0, and h3(x, y) = xy, the three equations agree and so yield a 
potential for F. However, we have only guessed at the values of A, h2, and h3. To 
derive the formula for f more systematically, we note that because f(x, y,z) = 
xy +h\(y, z) and df/dz = y cos yz, we find that 


dahi(y, z) 


dz "085 


or 


hi(y,z) = IE cos yz dz + g(y) = sin yz + gy). 


Therefore, substituting this back into equation (a), we get 
fx, y, z) 2 xy * sinyz + g(y); 
but by equation (b), 


g(y) = h(x, z). 


Because the right side of this equation is a function of x and z and the left side is a 
function of y alone, we conclude that they must equal some constant C. Thus, 


fx, y,z) 2 xy sinyz +C 


and we have determined f up to a constant. 4 


A mass M atthe origin in IR? exerts a force on a mass m located at 

r = (x, y, z) with magnitude Gm M/r? and directed toward the origin. Here, G is the 
gravitational constant, which depends on the units of measurement, and r = ||r|| = 
x? + y? + 22. If we remember that —r/r is a unit vector directed toward the origin, 
then we can write the force field as 


GmMr 
pc 


F(x,y,z)—-— 
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Show that F is irrotational and find a scalar potential for F. (Notice that F is not 
defined at the origin, but Theorem 7 still applies, because it allows an exceptional 
point.) 
SOLUTION First let us verify that V x F = 0. Referring to formula 10 in the 
table of vector identities in Section 4.4, we get 
l 1 
VxF=-GmM|V| 5] xr+5V xr}. 
p F 
But V(1/r?) 2 —3r/r (see Exercise 30, Section 4.4), and so the first term vanishes, 
because r x r — 0. The second term vanishes, because 
i j k 
ð ə a a a 8. a a 
Unie | ei SuSE aa LE yup; 
ax dy dz dy az az Ox Ox Oy 
X. y z 

Hence, V x F = 0 (for r z 0). 

If we recall the formula V(r”) = nr"—?r (again, see Exercise 30, Section 4.4), 
then we can read off a scalar potential for F by inspection. We have F = —V V, where 
V(x, ¥, Zz) = —GmM [r is called the gravitational potential energy. 

[We observe in passing that by Theorem 3 of Section 7.2, the work done by F in 
moving a particle of mass m from a point P, to a point P2 is given by 

1 1 
V(P1) — V(P2) = GnM = " 
mor 
where r; is the radial distance of P, from the origin, with r2 similarly defined.] 4 
The Planar Case 


By the same proof, Theorem 7 is also true for C! vector fields F on R?. In this 
case, we require that F has no exceptional points; that is, F is smooth everywhere 
(see Exercise 12). Notice, however, that the conclusion might still hold even if there 
are exceptional points, an example being (xi + yj)/(x? + y)*/*. An example where 
the conclusion does not hold is (—yi + xj)/(x? + y?), as shown in Exercise 12. 


If F = Pi + Qj, then 
vxr2 (2 - 7i 
Ox oy 


Sometimes 0Q/dx — 9 P/8y is called the scalar curl of F. Therefore, the condition 
V x F = 0 reduces to 
dP  0Q 


ðy ax” 
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Thus, we have: 


COROLLARY 1 If F isa C! vector field on R? ofthe form Pi + Qj that satisfies 
aP/dy = 0 Q/0x, then F = Vf for some f on R?. 


We emphasize again that this corollary can be false if F fails to be of class C! at 


even a single point (an example is given in Exercise 12). In IR, however, as already 
noted, exceptions at single points are allowed (see Theorem 7). 


| CWZIESEE (a) Determine whether the vector field 
F — ei 4 ej 


is a gradient field. 
(b) Repeat part (a) for 


F = (2x cos y)i — (x? sin y)j. 


SOLUTION (a) Here P(x, y) = e” and Q(x, y) = e**”, and so we compute 


These are not equal, and so F cannot have a potential function. 
(b) In this case, we find 


M es 2x sin 2e 
ay — Y 


and so F has a potential function f. To compute f we solve the equations 


a 9 
i = 2x cosy, z - —x? sin y. 


Thus, f(x, y) = x? cos y + hi(y) and f(x, y) = x? cos y + h(x). If hy and hz are 


the same constant, then both equations are satisfied, and so f(x, y) = x? cosy is a 
potential for F. A 


DGIB] Lete: [1, 2] > IR? begivenbyx = e'7!, y = sin(zt/t). Compute 
the integral 


[rase [ 2xc0sy dx —2? siny dy, 
c c 


where F = (2x cos y)i — (x? sin y)j. 
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SOLUTION The endpoints are c(1)— (1,0) and c(2)— (e, 1). Because 
(2x cos y)/dy = 8(—x? sin y)/dx, F is irrotational and hence a gradient vector field 
(as we saw in Example 3). Thus, by Theorem 7, we can replace c by any piecewise 
C! curve having the same endpoints, in particular, by the polygonal path from (1, 0) 
to (e, 0) to (e, 1). Thus, the line integral must be equal to 


e 1 
[r-[ 2rcos 0d + f — e sint dt = (e? — 1) + e*(cos1 — 1) 
c 1 0 
=e’ cosl —1. 


Alternatively, using Theorem 3 of Section 7.2, we have 


[2xcosyax x? sinydy = [ Vf -ds = re) f(e(1)) = e? cos 1 — 1, 


[3 [3 


because f(x, y) = x? cos y is a potential function for F. Evidently, this technique is 
simpler than computing the integral directly. A 


We conclude this section with a theorem that is quite similar in spirit to Theorem 
7. Theorem 7 was motivated partly as a converse to the result that curl Vf = 0 for any 
C! function f: IR? — R—or, if curl F = 0, then F = Vf. We also know [formula 
(9) in the table of vector identities in Section 4.4] that div (curl G) — 0 for any C? 
vector field G. We may ask about the converse statement: If div F — 0, is F the curl 
of a vector field G? The following theorem answers this in the affirmative. 


THEOREM 8 If F is a C! vector field on all of R? with div F = 0, then there 
exists a C! vector field G with F = curl G. 


The proof is outlined in Exercise 16. We should warn the reader at this point that, 
unlike the F in Theorem 7, the vector field F in Theorem 8 is not allowed to have an 
exceptional point. For example, the gravitational force field F = —(Gm Mr/r?) has 
the property that div F = 0, and yet there is no G for which F = curl G (see Exercise 
25). Theorem 8 does not apply, because the gravitational force field F is not defined 
at 0 € R?. 


EXERCISES 
1. Show that any two potential functions for a vector field on IR? differ at most by a constant. 
2. (a) Let F(x. y) = (xy, y?) and let c be the path y = 2x? joining (0, 0) to (1, 2) in IR?. 
Evaluate f. F- ds. 
(b) Does the integral in part (a) depend on the path joining (0, 0) to (1, 2)? 


3. Let F(x. y. z) = (2xyz + sinx)i + x?zj +x? yk. Find a function f such that F = Vf. 


4. Evaluate f. F - ds, where c(f) = (cos? t, sin? t, 1), 0 < t < zr, and F is as in Exercise 3. 
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5. If f(x) is a smooth function of one variable, must F(x, y) = f(x)i+ f(y)j bea 
gradient? 


6. (a) Show that F = —r/|[r||° is the gradient of f(x, y, z) = 1/r. 
(b) What is the work done by the force F = —r/|Ir||? in moving a particle from a point 
ro € R? “to 00,” where r(x, y, z) = (x, y, z)? 


7. Let F(x, y, z) = xyi + yj + zk. Can there exist a function f such that F = Vf? 


8. Let F = Fiji + Fj + Fk and suppose each F, satisfies the homogeneity condition 
Fy(tx, ty, tz) = t Fix, y, 2), k= 1,2,'3: 
Suppose also V x F = 0. Prove that F = Vf, where 
2f(x. y, z) =xF\(x, y, z) + yFo(x, yo z) + zFs(x, y, z). 


(Hint: Use Review Exercise 23, Chapter 2.) 


9. Let F(x, y, z) = (e" sin y)i + (e* cos y)j + 27k. Evaluate the integral f, F - ds, where 
e(t) = (Vt, P, exp /f),0 <t < 1. 


10. Let a fluid have the velocity field F(x, y, z) = xyi + yzj + xzk. What is the circulation 
around the unit circle in the xy plane? Interpret your answer. 


11. The mass of the earth is approximately 6 x 10?" g and that of the sun is 330,000 times as 
much. The gravitational constant is 6.7 x 1078 cm?/s? - g. The distance of the earth from the 
sun is about 1.5 x 10! cm. Compute, approximately, the work necessary to increase the 
distance of the earth from the sun by 1 cm. 


12. (a) Show that f(x dy — y dx)/(x? + y?) = 2x, where C is the unit circle. 

(b) Conclude that the associated vector field [—y/(x? + y?)]i + [x/(x? + y”)]j is nota 
conservative field. 

(c) Show, however, that 0 P/3y = 9 Q/Ox. Does this contradict the corollary to 
Theorem 7? If not, why not? 


13. Determine which of the following vector fields F in the plane is the gradient of a scalar 
function f. If such an f exists, find it. 


(a) FQ, y) 2 xi yj 
(b) F(x, y) = xyi + xyj 
(c) F(x, y) = (x? + yi  2xyj 


14. Repeat Exercise 13 for the following vector fields: 
(a) F(x, y) = (cosxy — xy sin xy)i — (x? sinxy)j 


(b) F(x, y) = (x /x?y? + Di (y /x?y? + Dj 


(c) F(x, y) = (2x cos y + cos y)i — (x? sin y + x sin y)j 
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15. Show that the following vector fields are conservative. Calculate f F - ds for the given 
curve. 


(a) F = (xy? + 3x?y)i + (x + y)x?j; C is the curve consisting of line segments from 

(1, 1) to (0, 2) to (3, 0). 
2. 2y(x? +1 

d Fechas 2U LE 
Sl PHF 
O<t<i. 

(c) F = [cos (xy?) — xy? sin (xy”)]i — 2x?y sin (xy*)j; C is the curve (e', e'*'), 
m 2 


C is parametrized by x = (?? — 1, y = tê — t, 


16. Prove Theorem 8. [HiNT: Define G = Gii + G2j + G3k by 


z 


62 [ Fey 0di - f Fy(x, t, 0)dt 
0 0 


Galx, 9,2) = -f Fix, y, D dt 
0 


and G3(x, y, z) = 0.] 


17. Is each of the following vector fields the curl of some other vector field? If so, find the 
vector field. 
(a) F 2 xi 4- yj -zk 
(b) F = (x? + Di + (z — 2xy)j + yk 
18. Let F = xzi — yzj + yk. Verify that V - F = 0. Find a G such that F = V x G. 
19. Repeat Exercise 18 for F = y’i -- z?j + x?^k. 
20. Let F = xe"i — (x cosz)j — ze"k. Find a G such that F = V x G. 


21. Let F = (x cos y)i — (sin y)j + (sin x)k. Find a G such that F = V x G. 


22. By using different paths from (0, 0, 0) to (x, y, z), show that the function f defined in 
the proof of Theorem 7 for “condition (ii) implies condition (iii)" satisfies 0f/dx = F, and 
af/dy = Fy. 


23. Let F be the vector field on IR? given by F = —yi + xj. 


(a) Show that F is rotational, that is, F is not irrotational. 

(b) Suppose F represents the velocity vector field of a fluid. Show that if we place a cork 
in this fluid it will revolve in a plane parallel to the xy plane, in a circular trajectory about the 
z axis. 

(c) In what direction does the cork revolve? 


24. Let G be the vector field on IR? {z axis} defined by 
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(a) Show that G is irrotational. 

(b) Show that the result of Exercise 23(b) holds for G also. 

(c) How can we resolve the fact that the trajectories of F and G are both the same 
(circular about the z axis) yet F is rotational and G is not? (Hint: The property of being 
rotational is a local condition, that is, a property of the fluid in the neighborhood of a point.) 


25. Let F = —(GmMr/r?) be the gravitational force field defined on IR? (0). 


(a) Show that div F — 0. 
(b) Show that F # curl G for any C! vector field G on R3\{0}. 


8.4 Gauss' Theorem 


Gauss' theorem states that the flux of a vector field out of a closed surface equals 
the integral of the divergence of that vector field over the volume enclosed by the 
surface. The result parallels Stokes’ theorem and Green's theorem in that it relates 
an integral over a closed geometrical object (curve or surface) to an integral over a 
contained region (surface or volume). 


Elementary Regions and their Boundaries 


We shall begin by asking the reader to review the various elementary regions in 
space that were introduced when we considered the volume integral; these regions 
are illustrated in Figures 5.5.2 and 5.5.4. As these figures indicate, the boundary of an 
elementary region in IR? is a surface made up of a finite number (at most six, at least 
two) of surfaces that can be described as graphs of functions from IR? to IR. This kind 
of surface is called a closed surface. The surfaces Sj, S2, ..., Sy composing such a 
closed surface are called its faces. 


EXAMPI The cube in Figure 8.4.1(a) is an elementary region, and in fact 
a symmetric elementary region, with six rectangles composing its boundary. The 
sphere in Figure 8.4.1(b) is the boundary of a solid ball, which is also a symmetric 
elementary region. A 


Figure 8.4.1 (a) Symmetric elementary regions 
and (b) the surface S; composing their 
boundaries. 
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Closed surfaces can be oriented in two ways. The outward orientation makes the 
normal point outward into space, and the inward orientation makes the normal point 
into the bounded region (Figure 8.4.2). 


Outward normal 


Figure 8.4.2 Two possible orientations for a closed surface. 


Suppose S is a closed surface oriented in one of these two ways and F is a vector 
field on S. Then, as we defined it in Section 7.6, 


f[r-5- X ff vas 


If S is given the outward orientation, the integral ff F - dS measures the total flux of F 
outward across S. That is, if we think of F as the velocity field ofa fluid, J ff. , F - dS indi- 
cates the amount of fluid leaving the region bounded by S per unit time. If S is given the 
inward orientation, the integral Ff. s F - dS measures the total flux of F inward across S. 
We recall another common way of writing these surface integrals, a way that 
explicitly specifies the orientation of S. Let the orientation of S be given by a unit 
normal vector n(x, y, z) at each point of S. Then we have the oriented integral 


[ [7-457 [femas 


that is, the integral of the normal component of F over 5. In the remainder of this 
section, if S is a closed surface enclosing a region W, we adopt the default convention 
that S = dW is given the outward orientation, with outward unit normal n(x, y, z) 
at each point (x, y, z) € S. Furthermore, we denote the surface with the opposite 
(inward) orientation by 3 Wap. Then the associated unit normal direction for this 
orientation is —n. Thus, 


ff 75 f [m mas - [[w-omas- - ff vas. 


DEAA The unit cube W given by 


O<x<l, 0<y<l, 0<z<1 


is a symmetric elementary region in space (see Figures 8.4.3 and 5.5.5). 
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ng= j Figure 8.4.3 The outward orientation on 
the cube. 


nj--k 


We write the faces as 


S::z-—0, 0x <1, 0xyzl 
S&:z=1, Ox sl, O0<y<l 
$3: x — 0, 0<y<l, 0xzzl 
S$;x-l O<y<l, 0<z<1 
Ssy=0, O<x<l, O<z<!1 
Ssy=1, Osx<l, O<z<l 


From Figure 8.4.3 we see that 


m -k--n, 
n; =i = —m, 
ng = j = —Ns, 


and so for a continuous vector field F = Fi + Fj + Fk, 


If F-as= ff ¥-nas=- ff ras+ [f ras- ff Fi dS 
aw s 5 Sr 5 
se J ras- ff ras ff FodS. A 
S4 Ss So 
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Gauss' Theorem 


We have now come to the last of the three central theorems of this chapter. This 
theorem relates surface integrals to volume integrals; in other words, the theorem 
states that if W is a region in IR’, then the flux of a vector field F outward across the 
closed surface dW is equal to the integral of div F over W. We begin by assuming 
that W is a symmetric elementary region (Figure 5.5.5). 


THEOREM 9: Gauss’ Divergence Theorem Let W be a symmetric 
elementary region in space. Denote by 8 W the oriented closed surface that bounds 
W. Let F be a smooth vector field defined on W. Then 


J[[, «v - ff ras 
[[[ «v = ff eas. 


PROOF IfF = Pi + Qj + Rk, then by definition, the divergence of F is given by 
div F = ðP /ðx + 9Q/8y + GR/8z, so we can write (using additivity of the volume 
integral) 


Jii serere scene ye 


On the other hand, the surface integral in question is 


J| 7 se5- ff ri i monas 
= ff rimas ff oi-nas ff minas. 


The theorem will follow if we establish the three equalities 


If, Pi-ndS = is av, (1) 
[[, eie f[[ 2o : 


or, alternatively, 
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and 


[ff monas - [ff Mar. o 


We shall prove equation (3); the other two equalities can be proved in an analogous 
fashion. 
Because W is a symmetric elementary region, there is a pair of functions 


z=gi(x,y), 2=g2(x,y), 


with common domain an elementary region D in the xy plane, such that W is the set 
of all points (x, y, z) satisfying 


gix, y) <z < g(x,y), Q.yeD. 


By reduction to iterated integrals, we have 


z=g2(x,y) 
{es av= LAE as de dx d, 
2=g1(x,y) 


and so, by the fundamental theorem of calculus, 
[J| av = [ [ noo me o - Roy, e166, de dy.) 


The boundary of W is a closed surface whose top Sz is the graph of z = g»(x, y), 
where (x, y) € D, and whose bottom S| is the graph of z = gi(x, y), (x, y) € D. The 
four other sides of d W consist of surfaces $5, S4, Ss, and Ss, whose normals are always 
perpendicular to the z axis. (See Figure 8.4.4. Note that some of the other four sides 


2 = g(x, y) 


Figure 8.4.4 A symmetric elementary region W for which 
ow RK- aS= fff, (9 R/8z) dV The four sides of AW, which 
are S3, S4, S5, Se have normals perpendicular to the z axis. 


z= g(x,y) 
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might be absent; for instance, if W is a solid ball and 0 W is a sphere.) By definition, 


ff ns - ff pim ds ff Rk- mas+ ff Rk- n; d$. 


Because the normal n; is perpendicular to k on each of 55, S4, Ss, S6, we have k - n = 0 
along these faces, and so the integral reduces to 


[ff si e ff easi + ff mes. (5) 


The surface S; is defined by z — gi(x, y), and 


a 
dS, = (i 8, _ k) asd 
9. oy 


(the negative of the general formula for dS for graphs from Section 7.6, because the 
normal is downward pointing). Therefore, 


If Rk- dS, =- ff R(x, y, gı(x, y))dx dy. (6) 


Similarly, for the top face S5, 


ae. 9 
d$; = ( -22i 787 +k ) ax dy. 
Ox ay 


Therefore, 


i] / Rk- dS, = f i RŒ, y, gale, y) dx dy. 0 
2 D 


Substituting equations (6) and (7) into equation (5) and then comparing with equation 


(4), we obtain 
IIl: adv = J| se mas 


The remaining equalities, (1) and (2), can be established in the same way to complete 
the proof. m 


Generalizing Gauss’ Theorem 


The reader should note that the proof of Gauss’ theorem is similar to that of Green’s 
theorem. By the procedure used in Exercise 8 of Section 8.1, we can extend Gauss’ 
theorem to any region that can be broken up into symmetric elementary regions. This 
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includes all regions of interest to us. An example ofa region to which Gauss' theorem 
applies is the region between two closed surfaces, one inside the other. The surface 
of this region consists of two pieces oriented as shown in Figure 8.4.5. We shall apply 
the divergence theorem to such a region when we prove Gauss' law in Theorem 10. 


Figure 8.4.5 A more general region to which Gauss' 
theorem applies. 


DO 2556) Consider F = 2xi + y?j + z2k. Let S be the unit sphere defined 
by x? + y? +2? = 1. Evaluate [fọ F-n dS. 


SOLUTION By Gauss' theorem, 


ffas- [[ S OH. 


where W is the ball bounded by the sphere. The integral on the right is 


2 fff a vea —2 fff av 2 fff vv 2 fff zar. 


By symmetry, we can argue that fff, y dV = ffi, z dV = 0 (see Exercise 17, Section 
6.3). Thus, because a sphere of radius R has volume 47 R?/3, 


2 fff rena a fff, av=. 


Readers can convince themselves that direct computation of [[; F- nas proves 
unwieldy. A 


DE Use the divergence theorem to evaluate 


n +y+z)d5, 


where W is the solid ball x? + y? +2? < 1. 
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SOLUTION To apply Gauss' divergence theorem, we find a vector field F — 
Fii+ ej + Fk on W with F-n — x? + y +z. At any point (x, y, z) € 9W, the 
outward unit normal n to dW is 


n —xi- yj +zk, 


because on 3 W, x? + y? + z? = 1 and the radius vector r = xi + yj + zk is normal 
to the sphere dW (Figure 8.4.6). 


n=xi+yj+zk 


Figure 8.4.6 n is the unit normal to 3 W, the 
boundary of the ball W. 


Therefore, if F is the desired vector field, then 
F-n = Fix + Foy + Faz. 


We set 


Fx=x°, Ry=y,  Fz-z 


and solve for F;, F}, and F; to find that F = xi + j + k. Computing div F, we get 
divF=1+0+0=1. 


Thus, by Gauss’ divergence theorem, 


n «ads fff dV — volume (W) — a A 


The Divergence as the Flux per Unit Volume 


The physical meaning of divergence is that at a point P, div F(P) is the rate of net 
outward flux at P per unit volume. This follows from Gauss’ theorem and the mean- 
value theorem for integrals: If W, is a ball in IR? of radius o centered at P. then there 
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is a point Q € W, such that 


ll. F-ndS= I div FdV = div F(Q)- volume (W,) 


and so 


; Siit zo d 
div F(P) — limit div F(Q) = lirait VOr,) I. F-nds. 
This is analogous to the limit formulation of the curl given at the end of Section 8.2. 
Thus, if div F(P) > 0, we consider P to be a source, for there is a net outward flow 
near P. If div F(P) < 0, P is called a sink for F. 

A C! vector field F defined on R? is said to be divergence free if div F = 0. If F 
is divergence-free, we have ff, s F -dS = 0 for all closed surfaces S. The converse can 
also be demonstrated readily using Gauss’ theorem: If ff, s F- 4S = 0 for all closed 
surfaces S, then F is divergence-free. If F is divergence-free, we thus see that the 
flux of F across any closed surface S is 0, so that if F is the velocity field of a fluid, 
the net amount of fluid that flows out of any region will be 0. Thus, exactly as much 
fluid must flow into the region as flows out (in unit time). A fluid with this property is 
therefore described as incompressible. 


DONTIBE SS) Evaluate ff, F - dS, where F(x, y, z) = xy?i + x?yj + ykand S 
is the surface of the cylinder x? + y? = 1, bounded by the planes z = 1 and z = —1, 
and including the portions x? + y? « 1 whenz = +1. 


SOLUTION One can compute this integral directly, but it is easier to use the 
divergence theorem. 

Now Sis the boundary of the region W given by x? + y? < 1, —1 < z < 1. Thus, 
Jf, F-4S = fff, (div F) dV. Moreover, 


1 
II ivFyav = ff G^ +y)dedyde = Í ([ G^ y)dsdy)a: 
Ww LA —1 Nx*y?«l 


= 2ff (x? + y?) dx dy. 
x24y2<1 


Before evaluating the double integral, we note that the surface integral satisfies 


I F-nas=2 [f (x? + y?) dx dy > 0. 
aw x24y2<1 


This means that ff, F - dS, the net flux of F out of the cylinder, is positive. 
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We change variables to polar coordinates to evaluate the double integral: 
x =rcosé, y —rsin6, 0<r<l, 0<6<2rn. 


Hence, we have (x, y)/8(r, 0) = r and x? + y? = r?. Thus, 


2n 1 
if 6^ exa = | (f Par) do = sn. 
x?+y? <1 0 0 2 


Therefore, fff, divFdV =x. A 


As we remarked earlier, Gauss' divergence theorem can be applied to regions in space 
more general than symmetric elementary regions. To conclude this section, we shall 
use this observation in the proof of the following important results. 


THEOREM 10: Gauss’ Law Let M be a symmetric elementary region 
in R2. Then if (0, 0, 0) g 3M, we have 


4r if (0, 0, 0) € M 
aM rd 0 if (0,0,0) g M, 
where 


r(x, y, z) = xi-- yj +zk 


and 


r(x, y, z) = In, y, z)l = Vx? + y? +22. 


PROOF OF GAUSS’ LAW First suppose (0,0,0) ¢ M. Then r/r? is a C! 
vector field on M and 0 M, and so by the divergence theorem, 


M dig 


But V - (r/r?) = 0 for r Æ 0, as the reader can easily verify (see Exercise 30, Sec- 


tion 4. 4) Thus, 
II 7 
M 


8.4 Gauss' Theorem S71 


Now let us suppose (0, 0, 0) € M. We can no longer use the preceding method, 
because r/7? is not smooth on M, on account ofthe zero denominator at r — (0, 0, 0). 
Because (0, 0, 0) € M and (0, 0, 0) ¢ 3M, there is an € > 0 such that the ball N of 
radius € centered at (0, 0, 0) is contained completely inside M. Let W be the region 
between M and N. Then W has boundary 3N U 3M = S. But the orientation on 3N 
induced by the outward normal on W is the opposite of that obtained from N (see 
Figure 8.4.7). 


Figure 8.4.7 Induced outward orientation on S; 
W is M minus the ball N. 


Now V- (r/r?) = 0 on W, and so, by the divergence theorem applied to the 
(nonelementary) region W, 


[fishes [fe Qn 
[t f, Ste [tn 


where n is the outward normal to S, we have 


jl aie erus 


However, on QN, n = —r/r and r = e, because 3N is a sphere of radius e, so 


that 
e 1 
dS = d$2— dS. 
UE r? Le all, 


But [jy dS = 4r &, the surface area of the sphere of radius €. This proves the 
result. W 


Because 
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PGW IQ) Gauss’ law has the following physical interpretation. The potential 
due to a point charge Q at (0, 0, 0) is given by 


E PS E 
4nr An x? +y? +22" 


and the corresponding electric field is 


E=-V¢= 2 (5). 


r3 


$(x, y,z) = 


Thus, Theorem 10 states that the total electric flux Ja m E: dS (that is, the flux of E 
out of a closed surface 9M) equals Q if the charge lies inside M and zero otherwise. 
Note that even if (0, 0, 0) ¢ M, E will still be nonzero on M. 

For a continuous charge distribution described by a charge density p in a region 
W, the field E is related to the density p by 


div E 2 V. E-— p. 


Thus, by Gauss' theorem, 


fh,- [fnr 


that is, the flux out of a surface is equal to the total charge inside. A 


Divergence in Spherical Coordinates 


We next use Gauss' theorem to derive the formula 


ð Fo 
dvF= T or, Fan ag agent + 30 (8) 


for the divergence of a vector field F in spherical coordinates, which was stated in 
Section 8.2. (Again, the subscripts here denote components, not partial derivatives.) 
The method is to use the formula 


div F(P) — limit z — o f. E (9) 


where W is a region with volume V(W ), which shrinks down to a point P (in the 
main text we took a ball, but one can use regions of any shape). Let W be the shaded 
region in Figure 8.4.8. 
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p sin ġ d0 


Figure 8.4.8 Infinitesimal volume determined by 
dp, d0, d$ at (p, 0, $). 


sin $ d0 
Forthetwo faces orthogonal to the radial direction, the surface integral in equation 
(9) is, approximately, 
F(p +dp, $, 0) x (area of outer face) — F,(p, ¢, 0) x (area of inner face) 


& F,(p + dp, $, 6Y(p + dpY sing d$ d6 — Fy(p, $, 0)? sing d d 


x 5 ur sing) dp d$ d0 (10) 


by the one-variable mean-value theorem. Dividing by the volume of the region W, 
namely, p? sin dp d$ d0, we see that the contribution to the right-hand side of 
equation (9) is 


1 a 
-a77 0^ Fp) (11) 
for these faces. Likewise, the contribution from the faces orthogonal to the ¢ direction 
is 


1 o EN 1 OF; 
"ud aoe oF); and for the 0 direction, aang. aa 


Substituting (11) and these expressions in equation (9) and taking the limit gives 
equation (8). 


EXERCISES 


1. Use the divergence theorem to calculate the flux of F = (x — y)i+ (y — z)j + (z — x)k 
out of the unit sphere. 


2. Let F = x?i + ^j + z?k. Evaluate the surface integral of F over the unit sphere. 
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3. Evaluate Dow F - dS, where F = xi + yj + zk and W is the unit cube (in the first octant). 
Perform the calculation directly and check by using the divergence theorem. 


4. Repeat Exercise 3 for 


(a) F=i+j+k 
(b) F 2 x?i - x?j +27k 


5. Let F = yi + zj + xzk. Evaluate ff, F - dS for each of the following regions W: 


(a) x!» xzzl 
(b) x? +y? <z < landx 20 
(c) x? +y? <z x 1andx <0 


6. Repeat Exercise 5 for F = (x — y)i + (y — z)j + (z — x)k. [The solution to part (b) only 
is in the Study Guide to this text.] 


7. Find the flux of the vector field F = (x — y?)i + yj + x°k out of the rectangular solid 
[0, 1] x [1, 2] x [1, 4]. 


8. Evaluate ff F - dS, where F = 3xy?i + 3x?yj + z°k and 5 is the surface of the unit 
sphere. 


9. Evaluate dag: F-ndA, where F(x, y, z) = xi + yj — zk and W is the unit cube in the 
first octant. Perform the calculation directly and check by using the divergence theorem. 


10. Evaluate the surface integral ff} ç F - n d 4, where F(x, y, z) = i + j + z(x? + yk and 
9S is the surface of the cylinder x? + y? < 1,0 < z < 1. 


11. Prove that 


[[[ nana = ff fr-nas- fff rv-Rax dyads. 


12. Prove the identity 
V-(F x G)=G-(V x F)—F-(V xG). 
13. Show that f[f,,(1/r?) dx dy dz = [fyy(r+m/r?) dS, where r = xi + yj + zk. 


14. Fix vectors vi, ..., v € IR? and numbers ("charges") q1, ... , qx. Define the function $ 
by (x, y, z) = E$ qi/ (47. llr — vill), where r = (x, y, z). Show that for a closed surface S 


and E = —V@, 
[[9-5-9. 
S 


where Q is the total charge inside S. (Assume that Gauss' law from Theorem 10 applies and 
that none of the charges are on S.) 
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15. Prove Green's identities 


[[_rvenas= fff re mr mv 
Vg—gVf):nd$ = (f V?g — gV?f)dV. 
fh veras - f] [rta m 


16. Suppose F satisfies div F = 0 and curl F = 0 on all of R?. Show that we can write 
F — Vf, where V? f — 0. 


and 


17. Let p be a continuous function on IR? such that p(q) = 0 except for q in some region W. 
Let q € W be denoted by q — (x, y, z). The potential of p is defined to be the function 


pq) 
ow = [I]. ipa LIAC 


where ||p — q|| is the distance between p and q. 
(a) Using the method of Theorem 10, show that 


[levees fen 


for those regions W that can be partitioned into a finite union of symmetric elementary 
regions. 
(b) Show that satisfies Poisson's equation 


Vig = —p. 
[HiNT: Use part (a).] (Notice that if p is a charge density, then the integral defining ¢ may 
be thought of as the sum of the potential at p caused by point charges distributed over W 


according to the density p.) 


18. Suppose F is tangent to the closed surface S = 0W of a region W. Prove that 


I (iv dy — 0. 


19. Use Gauss' law and symmetry to prove that the electric field due to a charge Q evenly 
spread over the surface of a sphere is the same outside the surface as the field from a point 
charge Q located at the center of the sphere. What is the field inside the sphere? 

20. Reformulate Exercise 19 in terms of gravitational fields. 


21. Show how Gauss' law can be used to solve part (b) of Exercise 25 in Section 8.3. 


22. Let S be a closed surface. Use Gauss' theorem to show that if F is a C? vector field, then 
we have ff.(V x F)-dS — 0. 
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23. Let S be the surface of region W. Show that 


[fe-nas =3 votume o. 
Ss 


Explain this geometrically. 


8.5 Some Differential Equations of Mechanics and Technology 


Isaac Newton reputedly said, “All in nature reduces to differential equations.” This 
point of view was paraphrased by Max Planck (see the Historical Note in Section 3.3): 
*. .. Present day physics, as far as it is theoretically organized, is completely governed 
by a system of space-time differential equations." 

Inthis section, we apply the central theorems of vector analysis to the derivation of 
the differential equations governing heat transfer, electromagnetism, and the motion 
of some fluids. 

Keep in mind the importance of these problems in modern technology. For exam- 
ple, a good understanding of fluids and the ability to do computations to solve their 
governing equations is at the heart of how one builds a modern airplane or designs 
a submarine. For instance, the flow of air (the fluid in this case) over the wings of 
an aircraft is very subtle, even though the governing equations are relatively simple. 
We shall derive a slightly idealized form of these equations in this section. Likewise, 
the equations of electromagnetism, as we will discuss in the following paragraphs, is 
central to the communications industry; wireless, television, and much of the opera- 
tion of modern electronic devices, including computers, depends on these and related 
fundamental equations. 


Conservation Laws 


As preparation for deriving the equations of a fluid, let us first discuss an impor- 
tant equation that is referred to as a conservation equation. For fluids, it expresses 
the conservation of mass; for electromagnetic theory, it expresses the conservation 
of charge. We shall apply these ideas to the equation for heat conduction and to 
electromagnetism. 

Let V(t, x, y, z) bea C! vector field on IR? for each ¢ and let p(t, x, y, z) bea C! 
real-valued function. By the Jaw of conservation of mass for V and p, we mean that 


the condition 
al] -- Ji 
— dV =— J-ndS 
dt w " aw 


holds for all regions W in IR?, where J = pV (see Figure 8.5.1). 

If we think of p as a mass density (p could also be a charge density)—that is, the 
mass per unit volume—and of V as the velocity field of a fluid, the condition simply 
says that the rate of change of total mass in W equals the rate at which mass flows 
into W. Recall that Ih y J: n dS is called the flux of J. We need the following result. 
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J 


mass in w=fffye dx dy dz 
J-n = mass flowing 


out of W per unit Figure 8.5.1 The rate of change 
area per unit time of mass in W equals the rate at 
which mass crosses ð W. 


THEOREM 11 For V and p (a smooth vector field and a scalar field on R°), the 
law of conservation of mass for V and p is equivalent to the condition 


i dp 
divJ+— =0. 1 
iv J+ àr (1) 
That is, 
E dp 
PEVE Vp = 0. (15 


Here, div J means that we compute div J for t held fixed, and dp/dt means we 
differentiate p with respect to ¢ for x, y, z fixed. 


PROOF First, observe that by differentiating under the integral, we get 


EI, pdxdydz = fff ? Fy dx dy dz 
If, 57 fff, divJ dV 


by the divergence theorem. Thus, conservation of mass is equivalent to the condition 


Ill, (avs 3 2) ds yas - 


Because this is to hold for all regions W, it is equivalent to div J + 9p/0t = 0. m 


and also 


The equation div J + 9p/8t = 0 is called the equation of continuity. An inter- 
esting remark is that using the change of variables formula, the law of conservation 
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of mass may be shown to be equivalent to the condition 


d 
— dV — 0, 
dt Ife 


where W, is the image of W obtained by moving each point in W along flow lines of V 
for time ¢. This result is a special case of the transport theorem that we discuss next. 


'The Transport Theorem 


The transport theorem is an interesting application of the divergence theorem that 
will be needed in our derivation of the equations of a fluid. 


FHEOREM 12 Let F be a vector field on R? and denote the flow line of F start- 
ing at x after time t by ó(x, t). (See the Internet supplement to Section 4.4 for more 
information.) Let /(x, t) be the Jacobian of the map gy: x > (x, t) for t fixed. Then 


ae. t) = [div F(ó(x, 0))]J(x, t). 


For a given function f(x, y, z, t) and aregion W C R2, the transport equation holds: 


d E Df : 
ral i sayz ndedyas = [ff (x + f div F) dx dy dz, 


where W, = $,(W ), which is the region moving with the flow, and where 


Df _ 
my = affat+ Vf. F 


is the material derivative. 


Taking f — 1, Theorem 12 implies that the following assertions are equivalent 
(which justifies the use of the term incompressible): 


l. divF=0 
2. volume (W,) = volume (W) 
3. J(&t)sl 


Let $, J, F, f be as just defined. There is also a vector form of the transport 
theorem, namely, 


d 
as (fF) dx dy dz 


= If [UPH vum eom div r] dx dy dz, 
W, 
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where F - V( f F) denotes the 3 x 3 derivative matrix D( f F) operating on the column 
vector F; in Cartesian coordinates, F - VG is the vector whose ith component is 


" . : : 
8G! agi agi 8G! 
F}—- =F, F. FI. 
HI Vag "fray a 
j=l 


We shall leave the proofs of these results, which are extensions of the arguments used 
to prove Theorem 11, to the reader (see the exercises). 


Derivation of Euler’s Equation of a Perfect Fluid 


A portion of OW 


The continuity equation is not sufficient to completely determine the motion of a 
fluid—we need other conditions. 

The fluids that the continuity equation governs can be compressible. If div V = 0 
(incompressible case) and p is constant, equation (1^) follows automatically. But in 
general, even for incompressible fluids, the equation is not automatic, because p can 
depend on (x, y, z) and t. Thus, even if the equation div V = 0 holds, div (pV) 4 0 
may still be true. 

Here we discuss Euler’s equation for a perfect fluid. Consider a nonviscous fluid 
moving in space with a velocity field V. When we say that the fluid is perfect, we 
mean that if W is any portion of the fluid, forces of pressure act on the boundary of 
W along its normal. We assume that the force per unit area acting on dW is —pn, 
where p(x, y, z, f) is some function called the pressure (see Figure 8.5.2). Thus, the 
total pressure force acting on W is 


Faw = force --ff pn dS. 
aw 


The forces exerted on W " " 
by the fluid occur across Figure 8.5.2 The force acting on dW per 


AW in the direction n. unit area is — pn. 


This is a vector quantity; the ith component of Faw is the integral of the ith 
component of pn over the surface dW (this is therefore the surface integral of a 
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real-valued function). If e is any fixed vector in space, we have 


Ey-e- - ff pe-ndS, 
aw 


which is the integral of a scalar over ð W. By the divergence theorem and identity (7) 
in the table of vector identities (Section 4.4), we get 


E-Fw - - ff div (pE)dx dy dz =- ff (grad p) -Edx dy dz, 
w w 
so that 


Foy = - fff Vpdx dy dz. 
wW 


Now we apply Newton's second law to a moving region W,. As in the transport 
theorem, W, = $,(W), where ¢;(x) = (x, t) denotes the flow of V. The rate of change 
of momentum of the fluid in W, equals the force acting on it: 


< fff pV dx dydz = Fw, = [ff Vp dx dy dz. 
at JJ Jy, i 


We apply the vector form of the transport theorem to the left-hand side to get 


ni [oo ev vtov pv aiv v|axayaz=- fff Vpdx dy dz. 
w, Lot W 


Because W, is arbitrary, this is equivalent to 
a A 
a; o +V-V(pV)+ pV div V = —Vp. 


Simplification using the equation of continuity, namely, formula (1^), gives 


av 
Ze +v-vv) =—Vp. (2) 
This is Euler’s equation for a perfect fluid. For compressible fluids, p is a given 
function of p (for instance, for many gases, p = Ap” for constants A and y). On the 
other hand, if the fluid is incompressible, p is to be determined from the condition 
div V — 0. Equations (1) and (2) then govern the motion of the fluid. 
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Un M bau oue 


The equations describing the motion of a fluid were first derived by 
Leonhard Euler in 1755, in a paper entitled "General Principles of the 
Motion of Fluids." Euler did basic work in mechanics as well as voluminous 
work in pure mathematics, a small part of which has already been discussed 
in this book; he essentially began the subject of analytical mechanics (as 
opposed to the Euclidean geometric methods used by Newton). He is 
responsible for the equations of a rigid body (equations that apply, for 
example, to a tumbling satellite) and the formulation of many basic equations 
of mechanics in terms of variational principles; that is, by the methods of 
maxima and minima of real-valued functions. Euler wrote the first 
comprehensive textbook on calculus and contributed to virtually all branches 
of mathematics. He wrote several books and hundreds of research papers 
even after he became totally blind, and he was working on a new treatise on 
fluid mechanics at the time of his death in 1783. Euler'sequations for a fluid 
were eventually modified by Navier and Stokes to include viscous effects; the 
resulting Navier-Stokes equations are described in virtually every textbook 
on fluid mechanics." Stokes is, of course, also responsible for developing 
Stokes' theorem, one of the main results discussed in this text! 


Conservation of Energy and the Derivation of the Heat Equation 


If T(t, x, y, z) (a C? function) denotes the temperature in a body at time t, then VT 
represents the temperature gradient: Heat “flows” with the vector field — VT = F. 
Note that VT points in the direction of increasing T. Because heat flows from hot 
to cold, we have inserted a minus sign to reflect this physically observable fact. The 
energy density, that is, the energy per unit volume, is coo T, where c is a constant 
(called the specific heat) and oo is the mass density, assumed constant. (We accept 
these assertions from elementary physics.) The energy flux vector is defined to be 
J = kF, where k is a constant called the conductivity. 

One now makes the hypothesis that energy is conserved. This means that J and 
p = cpoT should obey the law of conservation of mass, with p playing the role of 
“mass” (note that it is energy density, not mass); that is, 


t fon [nns 


By Theorem 11, this assertion is equivalent to 


à 
div J+ £ — 0. 
a 


"The Clay Foundation has offered a prize of $1 million to anyone who shows that for the incompressible Navier-Stokes 
equations, smooth data at ¢ = 0 lead to smooth solutions for all t > 0. 
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But 
div J = div(—kVT) = —V?T. 
(Recall that V? T = 9?T/8x? + 9? T/8y? + 9? T /z? and V? is the Laplace operator.) 


Continuing, we have 
9p _ A(cpoT) T" oT 
at 0t. — 8t 


Thus, the equation div J + 9p/8t = 0 becomes 


9T vor S ky?T, (3) 
ot €po 
where x = k/cpo is called the diffusivity. Equation (3) is the important heat equation. 
Just as equations (1) and (2) govern the flow of an ideal fluid, equation (3) 
governs the conduction of heat in the following sense. If T(0, x, y, z) is a given 
initial temperature distribution, then a unique 7 (f, x, y, z) is determined that satisfies 
equation (3). In other words, the initial condition at t = 0 gives the result for t > 0. 
Notice that if T does not change with time (the steady-state case), then we must have 
V?T — 0 (Laplace's equation). 


Maxwell’s Equations and the Prediction of Radio Waves: 
The Communication Revolution Begins 


We now return to Maxwells equations, which govern the propagation of electro- 
magnetic fields. The form of these equations depends on the physical units one is 
employing, and changing units introduces factors like 47 and the velocity of light. 
We shall choose the system in which Maxwell's equations are simplest. 

Let E and H be C! functions of (t,x, y, z) that are vector fields for each t. 
They satisfy (by definition) Maxwell’s equation with charge density p(t, x, y, z) and 
current density J(t, x, y, z) when the following conditions hold: 


V. E = p (Gauss’ law), (4) 
V - H = 0 (no negative sources), (5) 
VxE+ 2 — 0 (Faraday's law), (6) 
and 
VxH- = = J (Ampére’s law), (7) 


Of these laws, equations (4) and (6) were described in integral form in Sections 8.2 
and 8.4; historically, they arose in these forms as physically observed laws. Ampére’s 
law was mentioned for a special case in Section 7.2, Example 12. 
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Physically, one interprets E as the electric field and H as the magnetic field. 
According to the preceding equations, as time / progresses, these fields interact with 
each other, and with any charges and currents that are present. For example, the 
propagation of electromagnetic waves (TV signals, radio waves, light from the sun, 
etc.) in a vacuum is governed by these equations with J = 0 and p = 0. 

Because V - H — 0, we can apply Theorem 8 (from Section 8.3) to conclude that 
H — V x A for some vector field A. (We are assuming that H is defined on all of 
IR? for each time /.) The vector field A is not unique, and we can use A’ = A+ V f 
equally well for any function f(t, x, y, z), because V x V f = 0. (This freedom in 
the choice of A is called gauge freedom.) For any such choice of A, we have, by 
equation (6), 


oH a 
0=VxE+ =VxE+—VxA 
ot or 


ðA 
=VxEx Vx — 
at 


Applying Theorem 7 (from Section 8.3), there is a real-valued function ¢ on IR? such 
that 


Substituting this equation and H = V x A into equation (7), and using the vector 
identity (whose proof we leave as an exercise) 


V x (V x A) = V(V - A) — VA, 


we get 
dE 8 ( 9A 
=VxH-—=Vx(VxA)-—(-—-V 
SST Se ar z at s) 
A a 
= V(V-A) — VA Vo). 
(V-A) +o5 tu 
Thus, 


07A a 
VA — — =-J+ V(V- A) + —(V9). 
J2 +V(V +A) +O 
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That is, 
vA a 
vA- Sea 4 0 (vate), (8) 


Again using the equation E + 0A/dt = —V¢ and the equation V - E = p, we obtain 


aA a(V - A) 
=V.E=V. (-V¢ - — | = -V?$ - ———.. 
P ( $ x) at 


That is, 


(V-A 
vg = —p - EA, O) 


Now let us exploit the freedom in our choice of A. We impose the “condition” 


ag — 
VA =0. (10) 


We must be sure we can do this. Supposing we have a given Ao and a corresponding ġo, 
can we choose a new A = Ao + Vf and then a new ¢ such that V - A + 3¢ġ/3t = 0? 
With this new A, the new $ is ¢o — 0f/dt; we leave verification as an exercise for the 
reader. Condition (10) on f then becomes 


alpo — af/d ð ð 
07 vty = OD vas ILE 
or 
PS Ago 
2 = . 
V s (v Aot ux ) (11) 


Thus, to be able to choose A and ¢ satisfying V - A + 0/dt = 0, we must be 
able to solve equation (11) for f. One can indeed do this under general conditions, 
although we do not prove it here. Equation (11) is called the inhomogeneous wave 
equation. 

If we accept that A and $ can be chosen to satisfy V - A + 0¢/dt = 0, then 
equations (8) and (9) for A and @ become 


aA 

VA 38 775 (8) 
ao 

Vo- —. = —p. (9^) 
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Conversely, if A and @ satisfy the equations V -A + 09/81 = 0, V? — 
8?$/81? = —p, and V?A — 0?A/at? = —J, then E = —V$ — 8A/8t and H = 
V x A satisfy Maxwell's equations. This procedure then “reduces” Maxwell's equa- 
tions to a study of the wave equation.* 

Since the eighteenth century, solutions to the wave equation have been well 
studied (one learns these in most courses on differential equations). To indicate the 
wavelike nature of the solutions, for example, observe that for any function f, 


olt, x,y,z) = f(x — t) 


solves the wave equation V7 — (029/91?) = 0. This solution just propagates the 
graph of f like a wave; thus, one might conjecture that solutions of Maxwell's equa- 
tions are wavelike in nature. Historically, all of this was Maxwell's great achievement, 
and it soon led to Hertz's discovery of radio waves. To quote from the Feynman Lec- 
tures on Physics (Vol. II): 


From a long view of the history of mankind—seen from, say, ten thousand 
years from now—there can be little doubt that the most significant event 
of the nineteenth century will be judged as Maxwell's discovery of the 
laws of electrodynamics. The American Civil War will pale into provincial 
insignificance in comparison with this important scientific event of the 
same decade. 


Mathematics again shows its uncanny ability not only to describe but to predict 
natural phenomena. 

There are other techniques (called Green's function methods) for dealing with 
the basic equations of mechanics and mathematical physics that also rely on vector 
calculus. Some of these methods are discussed in the Internet supplement for this 
book. 


EXERCISES 
1. Use a direct argument (or the proof of Theorem 1 in the Internet supplement to Section 


4.4) to show that 


ENS 1) = [div F(@(x, 1))]J(x, t). 


"There are variations on this procedure. For further details, see, for example, Differential Equations of Applied Mathe- 
matics, by G. F. D. Duff and D. Naylor, Wiley, New York, 1966, or books on electromagnetic theory, such as Classical 
Electrodynamics, by J. D. Jackson, Wiley, New York, 1962. 


586 


The Integral Theorems of Vector Analysis 


2. Using the change of variables theorem and Exercise 1, show that if f(x, y, z, t) isa 
given function and W C R? is any region, then the transport equation holds: 


á Aff 1e - fff (26 + sain) acras 


where W, = $,(W), which is the region moving with the flow, and where Df/ Dt = 0f/dt + 
Vf -F is the material derivative. 


3, Use the transport equation to show that 


d 
= dxdydz=0 
a JJ one 


is equivalent to the law of conservation of mass. 


4. Using Exercise 3 and the change of variables theorem, show that p(x, t) can be expressed 
in terms of the Jacobian J(x, £) of the flow map #(x, t) and p(x, 0) by the equation 


P(x, t)J(x, t) = p(x, 0). 
What can you conclude from this for incompressible flow? 


5. Prove the vector form of the transport theorem, namely, 


Ml [Paxavaz= fff [mer vom eom div F |ax dyaz, 


where F - V( fF) denotes the 3 x 3 derivative matrix D (fF) operating on the column vector 
F; in Cartesian coordinates, F - VG is the vector whose ith component is 


3 7 ; ; ; 
aG' act 0G! aG’ 

Fj F F, F; 
M bae tray t Pa, 


6. Let V be a vector field with flow (x, t) and let V and p satisfy the law of conservation 
of mass. Let W, be the region transported with the flow. Prove the following version of the 


transport theorem: 
d Df 
asd dz — dx dy dz. 
g [[[, orsa: Med x dy dz 


7. (Bernoulli law) (a) Let V, p satisfy the law of conservation of mass and equation (2) 
(Euler's equation for a perfect fluid). Suppose V is irrotational and hence that V = V@ for a 
function $. Show that if C is a path connecting two points P, and P5, then 


ag ı jr [2 
L—— 4 IV +| —=0. 
(2 ri | t Jep 
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[Hint: You may need the vector identity, (V - V)V = 5 V(IVIP?) + (V x V) x V.] 
(b) If in part (a), V is stationary—that is, 9V/8t = 0—Aand p is constant, show that 


1 
svi + 2 
2 p 


is constant in space. Deduce that, in this situation, higher pressure is associated with lower 
fluid speed. 


8. Using Exercise 7, show that if $ satisfies Laplace's equation V?$ = 0, then V = Vó isa 
stationary solution to Euler's equation for a perfect incompressible fluid with constant density. 


9. Verify that Maxwell's equations imply the equation of continuity for J and p. 


10. Fora steady-state charge distribution and divergence-free current distribution, the 
electric and magnetic fields E(x, y, z) and H(x, y, z) satisfy 


VxE=0, V-H=0, V-J=0, V-E=p, and VxH-J. 


Here p — p(x, y, z) and J(x, y, z) are assumed to be known. The radiation that the fields 
produce through a surface S is determined by a radiation flux density vector field, called the 
Poynting vector field, 


P=ExH. 


(a) If S is a closed surface, show that the radiation flux—that is, the flux of P through 


S—is given by 
[[e-as=- fff ee. 
s y 


where V is the region enclosed by S. 
(b) Examples of such fields are 


E(x,y,z)—zj-c yk and — H(x,y,z) 2 —xyi 4 xj - yzk. 


In this case, find the flux of the Poynting vector through the hemispherical shell shown in 
Figure 8.5.3. (Notice that it is an open surface.) 


x24 y24 22225 


Figure 8.5.3 The surface for Exercise 10. 
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(c) The fields of part (b) produce a Poynting vector field passing through the toroidal 
surface shown in Figure 8.5.4. What is the flux through this torus? 


Figure 8.5.4 The surface for Exercise 10(c). 


8.6 Differential Forms 


The theory of differential forms provides an elegant way of formulating Green's, 
Stokes', and Gauss' theorems as one statement, the fundamental theorem of cal- 
culus. The birth of the concept of a differential form is another dramatic example of 
how mathematics speaks to mathematicians and drives its own development. These 
three theorems are, in reality, generalizations of the fundamental theorem of calculus 
of Newton and Leibniz for functions of one variable, 


b 
I f'G)dx = f(b) — f(a) 


to two and three dimensions. 

Recall that Bernhard Riemann created the concept of n-dimensional spaces. If 
the fundamental theorem of calculus was truly fundamental, then it should generalize 
to arbitrary dimensions. But wait! The cross product, and therefore the curl, does not 
generalize to higher dimensions, as we remarked in footnote 3, in Section 1.3. Thus, 
some new idea is needed. 

Recall that Hamilton searched for almost 15 years for his quaternions, which 
ultimately led to the discovery of the cross product. What is the nonexistence of a 
cross product in higher dimensions telling us? If the fundamental theorem of calculus 
is the core concept, this suggests the existence of a mathematical language in which it 
can be formulated in n-dimensions. In order to achieve this, mathematicians realized 
that they were forced to move away from vectors and on to the discovery of dual 
vectors and an entirely new mathematical object, a differential form. In this new 
language, all of the theorems of Green, Stokes, and Gauss have the same elegant and 
extraordinarily simple form. 

Simply and very briefly stated, an expression of the type P dx + Qdy is a 
1-form, or a differential one-form on a region in the xy plane, and F dx dy is a 2-form. 
Analogously, one can define the notion of an n-form. There is an operation d, which 


0-Forms 
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takes n-forms to n + 1-forms. It is like a generalized curl and has the property that 
for w = Pdx + Q dy, we have 


do = (2 - aP) axay 
ax oy 


and so in this notation, Green’s theorem becomes 


f w= [ do. 
aD D 


which, interestingly, just switches the boundary operator ð with the d operator. How- 
ever, differential forms are more than just notation. They create a beautiful theory 
that generalizes to n-dimensions. 

In general, if M is an oriented surface of dimension n with an (n — 1)-dimensional 
boundary 0M and if w is an (n — 1)-form on M, then the fundamental theorem of 
calculus (also called the generalized Stokes’ theorem) reads 


f w= f do. 
aM M 


A useful thing for the reader to contemplate at this stage is the sense in which 
the fundamental theorem of calculus becomes a special instance of this result. 

In this section, we shall give a very elementary exposition of the theory of forms. 
Because our primary goal is to show that the theorems of Green, Stokes, and Gauss can 
be unified under a single theorem, we shall be satisfied with less than the strongest 
possible version of these theorems. Moreover, we shall introduce forms in a purely 
axiomatic and nonconstructive manner, thereby avoiding the tremendous number 
of formal algebraic preliminaries that are usually required for their construction. 
To the purist our approach will be far from complete, but to the student it may be 
comprehensible. We hope that this will motivate some students to delve further into 
the theory of differential forms. 

We shall begin by introducing the notion of a 0-form. 


Let K be an open set in R°. A 0-form on K is a real-valued function f: K — R. When 
we differentiate f once, it is assumed to be of class C', and C? when we differentiate 
twice. 

Given two 0-forms f; and f? on K, we can add them in the usual way to get a 
new 0-form fı + f? or multiply them to get a 0-form fi f2. 


PGW OY f(x,y,z)—xy-yz and f(x, y,z)= ysinxz are 0-forms 


on R°: 


(fit Ae, yz) — xy + yz + ysinxz 
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and 


(i fo)Gr, yz) = y!xsinxz + y?zsinxz. A 


The basic 1-forms are the expressions dx, dy, and dz. At present we consider these to 
be only formal symbols. A 1-form w on an open set K is a formal linear combination 


w = P(x,y,z)dx + Q(x, y, z)dy + R(x, y, z) dz, 
or simply 
w = Pdx + Qdy + Rdz, 
where P, Q, and R are real-valued functions on K. By the expression P dx we mean 
the I-form P dx + 0- dy + 0- dz, and similarly for Q dy and R dz. Also, the order 
of P dx, Q dy, and R dz is immaterial, and so 


Pdx+Qdy+Rdz= Rdz + P dx + Qdy,etc. 


Given two 1-forms o, = Pı dx + Qı dy + Ri dz ando? = P5 dx + Q2dy + R2 dz, 
we can add them to get a new 1-form @ + a defined by 


@ + €» = (Pı + Pr) dx + (Qi + Q2)dy + (Ri + R2)dz, 
and given a 0-form f, we can form the 1-form fw, defined by 
fo = (fPi) dx + (Q1) dy + (fRi) dz. 


DAVI BP Let wy = (x + y?) dx + (zy) dy + (e7) dz and v» = sin y dx + 
sinx dy be 1-forms. Then 


wi +a. = (x + Y? + sin y) dx + (zy + sinx)dy + (e?) dz. 
If f(x, y, z) — x, then 


fo,=xsinydx+xsinxdy. A 


The basic 2-forms are the formal expressions dx dy, dy dz, and dzdx. These 
expressions should be thought of as products of dx and dy, dy and dz, and dz 
and dx. 
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A 2-form 1) on K is a formal expression 
n = Fdxdy-- Gdydz + H dzdx, 


where F, G, and H are real-valued functions on K. The order of F dx dy, G dy dz, 
and H dz dx is immaterial; for example, 


F dx dy + G dy dz + H dz dx = Hdzdx + F dx dy + G dy dz, etc. 


At this point it is useful to note that in a 2-form the basic 1-forms dx, dy, and dz 
always appear in cyclic pairs (see Figure 8.6.1), that is, dx dy, dy dz, and dz dx. 


Figure 8.6.1 The cyclic order of dx, dy, and dz. 


By analogy with 0-forms and 1-forms, we can add two 2-forms 
ni = F; dx dy + G; dy dz + H; dz dx, 
i = | and 2, to obtain a new 2-form, 
m +m = (Fi + Fy) dx dy + (Gi + G2) dy dz + (Hi + Hn) dz dx. 
Similarly, if f is a 0-form and 7 is a 2-form, we can take the product 
fn = (fF) dx dy + ({G) dy dz + (fH) dz dx. 


Finally, by the expression F dx dy we mean the 2-form F dx dy +0+dydz+ 
0-dzdx. 


DE] The expressions 


m =x? dx dy + y)x dy dz + sin zy dz dx 
and 


m = ydy dz 
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are 2-forms. Their sum is 
m +m = x^ dx dy + (y)x + y)dy dz + sinzy dz dx. 
If f(x, y, z) = xy, then 


fm —xy!dydz. A 


A basic 3-form is a formal expression dx dy dz (in this specific cyclic order, as 
in Figure 8.6.1). A 3-form v on an open set K C R? is an expression of the form 
v = f(x, y, z) dx dy dz, where f is a real-valued function on K. 

We can add two 3-forms and we can multiply them by 0-forms in the obvious 
way. There seems to be little difference between a 0-form and a 3-form, because both 
involve a single real-valued function. But we distinguish them for a purpose that will 
become clear when we multiply and differentiate forms. 


ID CNN] Let vı —ydxdydz, v; = e" dx dydz, and f(x, y, z) = xyz. 
Then v + v = (y +e) dx dy dz and fv = y?xzdx dydz. A 


Although we can add two 0-forms, two 1-forms, two 2-forms, or two 3-forms, 
we do not need to add a k-form and a j-form if k Æ j. For example, we shall not need 
to write 


fx, y, z) dx dy + g (x, y, z) dz. 


Now that we have defined these formal objects (forms), one can legitimately ask 
what they are good for, how they are used, and, perhaps most important, what they 
mean. The answer to the first question will become clear as we proceed, but we can 
immediately describe how to use and interpret them. 

A real-valued function on a domain K in R? is a rule that assigns a real number 
to each point in K. Differential forms are, in some sense, generalizations of the real- 
valued functions we have studied in calculus. In fact, 0-forms on an open set K are 
just functions on K. Thus, a 0-form f takes points in K to real numbers. 

We should like to interpret differential k-forms (for k 7 1) not as functions on 
points in K, but as functions on geometric objects such as curves and surfaces. Many 
of the early Greek geometers viewed lines and curves as being made up of infinitely 
many points, and planes and surfaces as being made up of infinitely many curves. 
Consequently, there is at least some historical justification for applying this geometric 
hierarchy to the interpretation of differential forms. 
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Given an open subset K C IR?, we shall distinguish four types of subsets of K 
(see Figure 8.6.2): 


(i) points in K, 
(ii) oriented simple curves and oriented simple closed curves, C, in K, 
(iii) oriented surfaces, S C K, 


(iv) elementary subregions, R C K. 


K 
Figure 8.6.2 The four geometric types of 
subsets of an open set K C IR? to which the 
theory of forms applies. 


The Integral of 1-Forms Over Curves 


We shall begin with 1-forms. Let 
€ = P(x, y, z)dx + Q(x, y, z) dy + R(x, y, z)dz 


be a 1-form on K and let C be an oriented simple curve as in Figure 8.6.2. The real 
number that w assigns to C is given by the formula 


[o7 [ rea 96.» 24» + Ré, y, zaz. q) 
c c 
Recall (see Section 7.2) that this integral is evaluated as follows. Suppose that 


€: [a, b] > K, e(t) = (x(t), y(t), z(t)) is an orientation-preserving parametrization 
of C. Then 


fe E Í im [ i LO y(t), z() + = 


d d 
+ OEO, »(0). 20) + REO, WO. 20)- =| dt. 


a 


Theorem | of Section 7.2 guarantees that fọ w does not depend on the choice of 
the parametrization c. 
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We can thus interpret a 1-form w on K as a rule assigning a real number to each 
oriented curve C C K; a 2-form 7 will similarly be seen to be a rule assigning a real 
number to each oriented surface $ C K; and a 3-form v will be a rule assigning a real 
number to each elementary subregion of K. The rules for associating real numbers 
with curves, surfaces, and regions are completely contained in the formal expressions 
we have defined. 


Let w = xy dx + y? dy + dz be a 1-form on R? and let C be the 
oriented simple curve in IR? described by the parametrization c(t) = (tê, t°, 1), 0 < 
t € 1. C is oriented by choosing the positive direction of C to be the direction in 
which c(f) traverses C as t goes from 0 to 1. Then, by formula (1), 


fo -f S(t) + (68:2) + O] dt = i (t$ + 3:8)dt = = 


Thus, this 1-form c assigns to each oriented simple curve and each oriented simple 
closed curve C in IR? the number fo w. A 


The Integral of 2-Forms over Surfaces 


A 2-form 7 on an open set K C R? can similarly be interpreted as a function that 
associates with each oriented surface S C K a real number. This is accomplished by 
means of the notion of integration of 2-forms over surfaces. Let 


n = F(x,y,z)dx dy + G(x, y, z) dy dz + H(x, y, z) dz dx 


be a 2-form on K, and let $ C K be an oriented surface parametrized by a function 
$:D— R?, DCR’, (u, v) = (x(u, v), y(u, v), z(u, v)) (see Section 7.3). 


DEFINITION If S is such a surface and r is a 2-form on K, we define Ws n by 
the formula 


[fo-f F dx dy + G dy dz + H dz dx 
s s 


ð 
- ff [Few v), y(u, v), z(u, v))- v mS 
n (2) 
+ G(x(u, v), y(u, v), z(u, v))- d = 
a(z, x) 


+ H(x(u, v), y(u, v), z(u, v))- 


au, 2 iae 
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where 
Ox Ox dy ay az Oz 
a(x, y) _|ðu ðv (yz) _|ðu ðv a,x) _|ðu ðv 
8(u v) jay Oy|' Qv) jðz Oz  Əu,v) |ðx Ax] 
ðu ðv ðu ðv ðu ðv 
If S is composed of several pieces 5;, i = 1,..., k, as in Figure 8.4.4, each with its 


own parametrization ®;, we define 


Ihr- 2h 


One must verify that ff, s ^] does not depend on the choice of parametrization ®. This 
result is essentially (but not obviously) contained in Theorem 4, Section 7.6. 


DONQ IG) Letn = z? dx dy be a 2-form on R?, and let S be the upper unit 
hemisphere in R?. Find ff, 7. 


SOLUTION Let us parametrize S by 
S(u, v) = (sinu cos v, sinu sin v, cos u), 


where (u, v) € D = [0, 2/2] x [0, 277]. By formula (2), 


f[- [f «oe onm 


where 
a(x, y) cosucosv —sinu sin v 
Ə(u, v)  |cosusinv sinucosv 
= sinu cosu cos? v + cos u sinu sin? v = sin u cosu. 
Therefore, 


[fsf] cos? u cos u sinu du dv 
s D 


2n pr/2 2n cos^u z/2 x 
sf f cos! usinu dudv = f [ | dv= >. A 
0 Jo 0 4 Io 2 


EXAMPL Evaluate ff. x dy dz + y dx dy, where S is the oriented surface 
described by the parametrization x = u + v, y = u? — v?, z = uv, where (u, v) € 
D = [0, 1] x [0, 1]. 
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SOLUTION By definition, we have 


9(y,z) _ 2u —2v 


5 ad 
$5.0 ^ [s 2 (u^ + v^) 
acy) |! 1! |-- 

alu, v) |2u -w| 2 v) 


Consequently, 


Í y xdydz + ydx dy = f [(u + v)(2)(u? + v?) + (u? — (72) + v)] du dv 
S D 


1 1 
=4 f (+10?) dudv=4 | [8 tu) dudv 
D 0 0 


=a f [e +] aaa G +2) au 


The Integral of 3-Forms over Regions 


Finally, we must interpret 3-forms as functions on the elementary subregions of K. 
Let v = f(x, y, z) dx dy dz be a 3-form and let R C K be an elementary subregion 
of K. Then to each such R C K we assign the number 


Ilf» [[[ 79. o. (3) 


which is just the ordinary triple integral of f over R, as described in Section 5.5. 


Suppose v = (x -F z) dx dy dz and R = [0,1] x [0, 1] x [0, 1]. 
Evaluate [ffp v. 


SOLUTION We compute: 


fff f[[ oe f [ [entras 
=[ [lz +a] vaf fG +2)dydz= | (5 +2)ae 


fig ET sa d 
B Ble 
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The Algebra of Forms 


We now discuss the algebra (or rules of multiplication) of forms that, together with 
differentiation of forms, will enable us to state Green's, Stokes', and Gauss' theorems 
in terms of differential forms. 

If c» is a k-form and ņ is an /-form on K, 0 < k +1 < 3, there is a product called 
the wedge product w ^ n of œw and 7 that is a k +/ form on K. The wedge product 
satisfies the following laws: 


(i) For each k there is a zero k-form 0 with the property that 0 + w = c for all 
k-forms w and 0 ^ n = 0 for all /-forms n if0 < k +1 <3. 


(ii) (Distributivity) If f is a 0-form, then 
(for + 2) An = flor An) + (92 An). 


(iii) (Anticommutativity) œ ^ n = (—V)" (n ^ c). 


(iv) (Associativity) If cw, @2, œ are kı, k2, ky forms, respectively, with ki + k2 + 
k3 < 3, then 


@ ^ (92 ^ w3) = (w1 ^ 92) ^ %3. 
(v) (Homogeneity with respect to functions) If f is a 0-form, then 
Q ^ Cf) = (fo) ^n = flw ^). 


Notice that rules (ii) and (iii) actually imply rule (v). 


(vi) The following multiplication rules for 1-forms hold: 


dx ^ dy = dx dy 

dy ^ dx = —dxdy = (—1)(dx ^ dy) 

dy ^ dz = dydz =(—1)(dz ^ dy) 

dz ^ dx = dzdx =(—1)(dx ^ dz) 

dx ^ dx =0, dy ^ dy 20, dz ^ dz 0 

dx ^(dy ^ dz) 2 (dx ^ dy) ^ dz = dx dy dz. 


(vii) If f is a 0-form and w is any k-form, then f ^ @ = fo. 


Using laws (i) to (vii), we can now find a unique product of any /-form 7 and any 
k-form o, if0 <k+1 <3. 
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DIAB] Show that dx ^ dydz = dx dy dz. 


SOLUTION By rule (vi), dy dz = dy ^ dz. Therefore, 


dx ^ dydz = dx ^(dy ^ dz) = dxdydz. & 


DGIB If w=xdx+ydy and n=zydx+xzdy+xydz, find 
Q ^I. 


SOLUTION Computing o ^ 7, to get 


OQ ^n = (xdx + ydy) ^ (zydx + xzdy + xy dz) 
= [@ dx + y dy) ^ (zy dx)] + [Œ dx + y dy) ^ (xz dy)] 
+ [(x dx + y dy) ^ (xy dz)] 
= xyz(dx ^ dx) ^- zy (dy ^ dx) + x?z(dx ^ dy) + xyz(dy ^ dy) 
E x?y(dx ^ dz) + xy (dy ^ dz) 
= —zy! dx dy + x?z dx dy — x?y dz dx + xy! dy dz 
= (xz — yz) dx dy — x!y dz dx + xy!dydz. A 


DANIYA Ifc —xdx — ydy and = x dy dz + z dx dy, find w ^ y. 
SOLUTION 


w ^An = (x dx — ydy) ^ (x dy dz + z dx dy) 

= [(x dx — y dy) ^ (x dy dz)] + [(x dx — y dy) ^ (z dx dy)] 

= (x? dx ^ dy dz) — (xy dy ^ dy dz) + (xz dx ^ dx dy) 
— (yzdy ^ dx dy) 

= [x? dx ^ (dy ^ dz)] — [xy dy ^ (dy ^ dz)] + [xz dx ^ (dx ^ dy)] 
— [yz dy ^ (dx ^ dy)] 

=x? dx dy dz — [xy(dy ^ dy) ^ dz] + [xz(dx ^ dx) ^ dy] 
— [»z(dy ^ dx) ^ dy] 

= x? dx dy dz — xy(0 ^ dz) + xz(0 ^ dy) + [vz(dy ^ dy) ^ dx] 

=x?dxdydz. A 


The last major step in the development of this theory is to show how to differentiate 
forms. The derivative of a k-form is a (k + 1)-form if k « 3, and the derivative of a 
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3-form is always zero. If w is a k-form, we shall denote the derivative of w by dw. 
The operation d has the following properties: 


(1) If f:K — Risa 0-form, then 


3 a 
tps ae Laya Lär 
Ox ay üz 


(2) (Linearity) If w and c» are k-forms, then 
d(@ + @2) = dw, + do». 
(3) Ifc is a k-form and 7 is an /-form, 
d(w ^ 1) = (de An) + (- (o ^ dn). 
(4) d(do) = 0 and d(dx) = d(dy) = d(dz) = 0 or, simply, d? = 0. 


Properties (1) to (4) provide enough information to allow us to uniquely differentiate 
any form. 


PON IIH IY) Leto = P(x, y, z) dx + Q(x, y, z) dy bea 1-form on some open 
set K C IR?. Find do. 


SOLUTION 


d[P(x, y, z) dx + Q(x, y, z) dy] 


= d[P(x, y, z) ^ dx] - d[Q(x, y, z) ^ dy] (using 2) 
— (dP ^ dx) - [P ^ d(dx)] - (dQ ^ dy) -[Q ^ d(dy)] (using 3) 
— (dP ^ dx) - (dQ ^ dy) (using 4) 
ðP aP aP 
= E: dx + 3y dy + x de) A dx 
aQ aQ dQ i 
T (2 dx + y? + rm dz) A dy (using 1) 


= (Zan ds) e (Zaya ds) (azn ax) 
ax oy Oz 
aQ aQ aQ 
«(58 dx ^ 4)« (e dy^ a) +(4 dz ^ dy 


Q dx dy ag dy dz 
x Oz 


oP ðP a 
= dx d dzd. 
ay xdy + az ZR Ten 


= (e — z) dx dy + BP zdy = 3B auda: A 
y Oz 0z 
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DEW IERIE) Let f be a 0-form. Using only differentiation rules (1) to (3) and 
the fact that d(dx) = d(dy) = d(dz) = 0, show that d(df) = 


SOLUTION By rule (1), 


df= adr + dy + 2. dr, 


and so 


sensa e (o) 2) 


Working only with the first term, using rule (3), we get 


a(2Las) a(3 » ax) 0 (52) ^ a naan 


ax Ox ax 
vf Pf Pf 
= d. d dz | ^dx 0 
g ME y t azox z) x+ 
f f 
aya dy A dx + 22x dz ^ dx 
af dxdy+ af dzdx 
L—— Z 
ðyðx à ðzðx 


and 


af rf rf 
a( E dz) 3xüz dzdx + ayaz dy dz. 


Adding these up, we get d(df) = 0 by the equality of mixed partial derivatives. A 


POISE SE} Show that d(dx dy), d(dy dz), and d( dz dx) are all zero. 
SOLUTION To prove the first case, we use property (3): 


d(dx dy) = d(dx ^ dy) = [d(dx) ^ dy — dx ^ d(dy)] = 0. 


The other cases are similar. A 
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If n = F(x, yz) dx dy + G(x, y, z)dy dz + H(x, y, z) dz dx, 
find dn. 
SOLUTION By property (2), 
dn = d(F dx dy) + d(G dy dz) + d(H dz dx). 
We shall compute d(F dx dy). Using property (3) again, we get 
d(F dx dy) = d(F ^ dx dy) = dF ^ (dx dy) + F ^ d(dx dy). 


By Example 14, d(dx dy) — 0, so we are left with 


dF ^(dxdy) = (= dx + o dy + "d dz) ^ (dx ^ dy) 


OF OF 
= [aes ^(dx ^ 4] + [Eo ^(dx ^ ay)| 


[ete ts ap]. 
Oz 
Now 
dx ^ (dx ^ dy) = (dx ^ dx) ^ dy =0 Ady =0, 
dy ^ (dx ^ dy) = —dy A (dy ^ dx) 
= —(dy Ady) Adx —-0^dx — 0, 
and 


dz ^ (dx ^ dy) = (—1) (dx ^ dy) ^ dz = dx dy dz. 


Consequently, 
oF 
d(F dx dy) = a dx dy dz. 
Zz 


Analogously, we find that 


H 
d(G dy dz) = ax dy dz and d(H dz dx) = p dx dy dz. 
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Therefore, 


dF 0G 0H 
elc A 
(S + ax + m 2E) dx dydz. 


We have now developed all the concepts needed to reformulate Green’s, Stokes’, 
and Gauss’ theorems in the language of forms. 


THEOREM 13: Green’s Theorem Let D be an elementary region in the xy 
plane, with dD given the counterclockwise orientation. Suppose w = P(x, y)dx + 
Q(x, y)dy is a 1-form on some open set K in R? that contains D. Then 


CE 


Here dw is a 2-form on K and D is in fact a surface in R? parametrized by 
®: D > R?, (x, y) = (x, y, 0). Because P and Q are explicitly not functions of z, 
then 9P/8z and 8 Q/0z = 0, and by Example 12, dw = (8Q/dx — 9P/8y) dx dy. 
Consequently, Theorem 13 means nothing more than that 


[Part oay= If, (Fo - FE) aay, 


which is precisely Green’s theorem. Hence, Theorem 13 holds. Likewise, we have the 
following theorems. 


THEOREM 14: Stokes’ Theorem Let S be an oriented surface in R? with 


a boundary consisting of a simple closed curve dS (Figure 8.6.3) oriented as the 
boundary of S (see Figure 8.2.1). Suppose that w is a 1-form on some open set K that 


contains S. Then 
{ o= i) do. 
as S 


Figure 8.6.3 An oriented surface to which 
Stokes’ theorem applies. 
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THEOREM 15: Gauss’ Theorem Let W C R? bean elementary region with 
dW given the outward orientation (see Section 8.4). If n is a 2-form on some region 


K containing W, then 
Í, JII, 


The reader has probably noticed the strong similarity in the statements of these 
theorems. In the vector-field formulations, we used divergence for regions in IR? 
(Gauss’ theorem) and the curl for surfaces in IR? (Stokes! theorem) and regions in 
IR? (Green's theorem). Here we just use the unified notion of the derivative of a 
differential form for all three theorems; and, in fact, we can state all theorems as one 
by introducing a little more terminology. 

By an oriented 2-manifold with boundary in R? we mean a surface in IR? whose 
boundary is a simple closed curve with orientation as described in Section 8.2. By an 
oriented 3-manifold in R? we mean an elementary region in R? (we assume its bound- 
ary, which is a surface, is given the outward orientation discussed in Section 8.4). We 
call the following unified theorem “Stokes’ theorem,” according to the current con- 
vention. 


THEOREM 16: General Stokes’ Theorem Let M be an ori- 
ented k-manifold in R? (k = 2 or 3) contained in some open set K. Suppose 
w is (k — 1)-form on K. Then 


f v= f dw. 
aM M 


Here the integral is interpreted as a single, double, or triple integral, as is appro- 
priate. In fact, it is this form of Stokes’ theorem that generalizes to spaces of arbitrary 
dimension. 


EXERCISES 


1. Evaluate w ^ n if 


(a) w=2x dx + y dy (d) w=xydydz+x? dx dy 
n — x? dx + y! dy n= dx + dz 

(b) w=xdx — ydy (e) w=% dx dy 
n=ydx+xdy n=e* dz 


(€) w=xdx+ydy+zdz 
n=zdxdy+xdydz+ydzdx 
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2. Prove that 


3 
(a1 dx + a? dy + a3 dz) ^ (bı dy dz + by dz dx + b; dx dy) = (Zon) dx dy dz. 
i=l 


3. Find dw in the following examples: 


(a) o — x!y +y? (e) w = (x? + y?)dy dz 

(b) w = y? cosx dy + xy dx + dz (f) w = (x? + y? - z)az 

(c) œ — xy dy +(x + yy dx _ = y 

(d) w = x dx dy + zdy dz + y dz dx Qaes mapt tug 


(h) w = x?y dy dz 


4. Let V: K — R? be a vector field defined by V(x, y, z) = G(x, y, z)i + H(x, y, z)j + 
F(x, y, z)k, and let ņ be the 2-form on K given by 


n = Fdxdy + Gdydz + H dzdx. 


Show that dy = (div V) dx dy dz. 


5. If V = A(x, y, zji + B(x, y, z)j + C(x, y, z)k is a vector field on K C R?, define the 
operation Formz: Vector Fields — 2-forms by 


Form;(V) = A dy dz + B dz dx + C dx dy. 


(a) Show that Form;(a V; + V2) = a Form (Vi) + Form2(V2), where o is a real 
number. 


(b) Show that Form;(curl V) = dw, where w = A dx + B dy + C dz. 


6. Using the differential form version of Stokes’ theorem, prove the vector field version in 
Section 8.2. Repeat for Gauss' theorem. 


7. Interpret Theorem 16 in the case k — 1. 
8. Let — (x + y)dz + (y + z) dx + (x +z)dy, and let S be the upper part of the unit 
sphere; that is, S is the set of (x, y, z) with x? + y? + z2? = 1 and z > 0. 05 is the unit circle 


in the xy plane. Evaluate f,. c» both directly and by Stokes’ theorem. 


9. Let T be the triangular solid bounded by the xy plane, the xz plane, the yz plane, and 
the plane 2x + 3y + 6z = 12. Compute 


If F\ dx dy + Fydydz+ Fdzdx 
‘ar 
directly and by Gauss’ theorem, if 


(a) Fi =3y, Fy = 18z, F; = —12; and 
(b+) Fioz h=x?, R=y. 


Review Exercises 605 


10. Evaluate ff, œ where w = z dx dy + x dy dz + y dz dx and S is the unit sphere, directly 
and by Gauss' theorem. 


11. Let R be an elementary region in R°. Show that the volume of R is given by the formula 


1 
v(R) = z [| onn +zdxdy. 


12. In Section 4.2, we saw that the length /(c) of a curve e(t) = (x(t), y(t), z(t), a < t < b, 


was given by the formula 
^ ( ds 
l(c) = [| ds= — 4t 
e- f[a- [ (2) 


where, loosely speaking, (ds)? = (dx) + (dy)? + (dz), that is, 


ds dx\? dy 2 dz\* 
T + T 
dt dt dt dt 
Now suppose a surface S is given in parametrized form by ®(u, v) = (x(u, v), y(u, v), 
z(u, v)), where (u, v) € D. Show that the area of S can be expressed as 


a= ff ds 


where formally (d$) = (dx ^ dy)? + (dy ^ dz} + (dz ^ dx}, a formula requiring 
interpretation. (HINT: 


and similarly for dy and dz. Use the law of forms for the basic 1-forms du and dv. Then dS 
turns out to be a function times the basic 2-form du dv, which we can integrate over D.) 


REVIEW EXERCISES FOR CHAPTER 8 


1. Let F = 2yzi + (—x + 3y + 2) + (x? + z)k. Evaluate Sfs (V x F)* dS, where S is the 
cylinder x? + y? = a?, 0 < z < 1 (without the top and bottom). What if the top and bottom 
are included? 


2. Let W be a region in IR? with boundary 3 W. Prove the identity 


[[exoxor-as= fff e m on - fff rw x V x GdV. 


3. Let F = x?yi + z5j — 2xyzk. Evaluate the integral of F over the surface of the unit cube. 
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4. Verify Green's theorem for the line integral 
| x!ydx + y dy, 
C 


when C is the boundary of the region between the curves y — x and y — x, 0<xz<l. 


5. (a) Show that F = (x? — 2xy°)i — 3x? y?j is a gradient vector field. 
(b) Evaluate the integral of F along the path x = cos? 0, y = sin 0,0 < 0 < 7/2. 


6. Can you derive Green's theorem in the plane from Gauss’ theorem? 
7. (a) Show that F = 6xy(cos z)i + 3x?(cos z)j — 3x?y(sinz)k is conservative (see 
Section 8.3). 
(b) Find f such that F — V f. 
(c) Evaluate the integral of F along the curve x = cos! 0, y = sin’ 0, z = 0, 
0<6<7/2. 


8. Let r(x, y, z) = (x, y, z), r = |ir||. Show that V?(logr) = 1/r? and V?(r") = 
n(n + Wr". 


9. Let the velocity of a fluid be described by F = 6xzi + x?yj + yzk. Compute the rate at 
which fluid is leaving the unit cube. 


10. Let F = x?i + (x?y — 2xy)j — x?zk. Does there exist a G such that F = V x G? 


11. Let a be a constant vector and F = a x r [as usual, r(x, y, z) = (x, y, z)]. Is F 
conservative? If so, find a potential for it. 


12. (For students who have read Section 8.5.) Consider the case of incompressible fluid flow 
with velocity field F and density p. 


(a) If p is constant for each fixed /, then show that p is constant in ¢ as well. 
(b) If p is constant in f, then show that F - Vp = 0. 


13. (a) Let f(x, y, z) = 3xye" . Compute V f. 
(b) Let e(t) = (3 cos? t, sin? t, e'),0 < t x n. Evaluate 


[vres 


(c) Verify directly Stokes’ theorem for gradient vector fields F = V f. 


14. Using Green's theorem, or otherwise, evaluate f, x? dy — y? dx, where C is the unit 
circle (x? + y? = 1). 


15. Evaluate the integral ff. F- dS where F = xi + yj + 3k and where S is the surface of 
the unit sphere x? + y? +z? = 1. 


16. (a) State Stokes’ theorem for surfaces in IR?. 
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(b) Let F be a vector field on IR? satisfying V x F = 0. Use Stokes’ theorem to show 
that f, F- ds = 0 where C is a closed curve. 


17. Use Green's theorem to find the area of the loop of the curve x = a sin 8 cos 0, 
y — asin? 0, fora > 0 and0 « 0 < r. 


18. Evaluate fs yz dx + xz dy + xy dz where C is the curve of intersection of the cylinder 
2 


x? + y? = | and the surface z = y?. 
19. Evaluate f(x + y) dx + (2x — z) dy + (y + z) dz where C is the perimeter of the 
triangle connecting (2, 0, 0), (0, 3, 0), and (0, 0, 6) in that order. 


20. Which of the following are conservative fields on IR?? For those that are, find a function 
f such that F — V f. 


(a) F(x, y, z) = 3x?yi + x?j + 5k 
(b) F(x, y, z) = (x + z) — (y + z)j + (x — y)k 
(c) FQ, y, z) = 2xy?i + x723j + 3x* yz^k 


21. Consider the following two vector fields in IR?: 


(i) F(x, y, z) = yi - zj + x?k 

(ii) G(x, y, z) = 3 — 3xy*)i+ (y? — 3x2y)j + zk 

(a) Which of these fields (if any) are conservative on IR?? (That is, which are gradient 
fields?) Give reasons for your answer. 

(b) Find potential for the fields that are conservative. 

(c) Leto be the path that goes from (0, 0, 0) to (1, 1, 1) by following edges of the cube 
0<x<1,0<y<1,0<z <1 from(0,0, 0) to (0, 0, 1) to (0, 1, 1) to (1, 1, 1). Let B be 
the path from (0, 0, 0) to (1, 1, 1) directly along the diagonal of the cube. Find the values of 


the line integrals 
[r* [9 nz [95 
a a B B 


22. Consider the constant vector field F(x, y, z) = i + 2j — k in R’. 


(a) Find a scalar field (x, y, z) in R? such that Vp = F in IR? and ¢(0, 0, 0) = 0. 
(b) On the sphere © of radius 2 about the origin find all the points at which 

(i) $ is a maximum, and 

(ii) $ is a minimum. 
(c) Compute the maximum and minimum values of ¢ on X. 


23. Let F be a C! vector field and suppose V - F(xo, yo, zo) > 0. Show that for a sufficiently 
small sphere S centered at (xo, yo, zo), the flux of F out of S is positive. 


24. Let B C IR? bea planar region, and let O € IR? be a point. If we connect all points in B to 
O we get a cone, say C, with vertex O and base B. Show that 


volume (C) — ; area (B) h, 


where A is the distance of O from the plane of B, using the following steps. 
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Figure 8.R.1 


l. LetO be the origin of the coordinate system. Define r(x, y, z): = (x, y, z). Evaluate the 
flux of r through the boundary of C, that is, [fyo r+ ndA, where n is the outward unit 
normal to 9C. 

2. Evaluate the divergence of r in C, that is, [ffe V +r av. 

3. Use Gauss' theorem, which states that the total divergence of a vector field within a 
region enclosed by a surface is equal to the flux of that vector field through the surface: 


J[[ vro - ff rn. 


Answers to Odd-Numbered 


Exercises 


Some solutions requiring proofs may be incomplete or be omitted. 
Chapter 1 
Section 1.1 


1. 4:17 


5. 


3. (—104 4- 16a, —24 — 4b, —22 4- 26c) 


7. 


y= 


9. x 20z-0,yeR;x-0,y 


OzeR;y 


0,x,zER;x=0,y,zER 


11. (Qs, 7s +2, 7t) | s e R,t € R} 


15. Kr) = Qt — 1)i — j + Qt — Dk 


13. Kr) 2 —i (t — Dj—k 


17. [si-3sk-25j]0 s 1,0811) 


19. If (x, y, z) lies on the line, then x 


2+ty 


2+ t,andz = —1 + t. Therefore, 


2x —3y+z2-2=4+42t+6-3t 
satisfies both conditions. 


l+t 


21. Yes. 


7, which is not zero. Hence, no (x, y, z) 
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23. The set of vectors of the form 
v=pa+qb+re 
where0 < p<1,0<q<l,andO0<r<1l. 
25. All points of the form 
(xo + Gri — xo) + s(xa — xo), yo + t(yi — yo) + s(ya — yo), zo + t(zi — zo) + s(z2 — zo) 
for real numbers ż and s. 


27. If one vertex is placed at the origin and the two adjacent sides are u and v, the new 
triangle has sides bu, bv, and b(u — v). 


29. (1,0, 1) +(0, 2, 1) = (0, 2, 0) + (1, 0, 2) 


31. Two such lines (there are many others) are x = 1, y = t, z = t and x =1,y=t,z 


Section 1.2 

1. 6 

3. 99° 

5. No, it is 75.7; it would be zero only if the vectors were parallel. 

7. [ull = V5, Ivii = V2, u-v = -3 9. llull = VIT, [lvl] = v62, u-v = —14 
11. lu] = 4/14, llv]] = 26, u-v = —17 


13. In Exercise 9, cos~! (- 14/4/114/62); in Exercise 10, 2/2; and in 11, 
cos"! (-17/4/144/26). 


15. —4(—i + j + K)/3 17. Any (x, y, z) with x + y +z = 0; 
for example, (1, —1, 0) and (0, 1, —1). 


19. i+ 4j, 0 ~ 0.24 radian east of north 21. (a) 12:03 »M. (b) 4.95 km 


25. (4.9, 4.9, 4.9) and (—4.9, —4.9, 4.9) N. 


Fy- 50 sin (50°) Ib 


Fx= 50 cos (50°) Ib 


27. (a) F = BV2i + 34/2j) (b) 0.322 radian or 18.4° (c) 18/2 
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Section 1.3 


121 301 
15 0 12-8 h 2 1-28 

202 202 

3. —3i+j+ 5k 5. v35 
7. 10 9. +k 


11. &(113i + 17j — 103k)//23, 667 


13. u+v=3i—3j+3k;u-v = 6; |jul| = V6; ||v|| = 3;u x v = —3i + 3k 


15. (a x -y -tz— 1-20 (c) 5x + 2z 2 25 
(b) x -2y -32- 6-0 (d) x + 2y — 3z = 13 


17. (a) The parallel planes Ax + By + Cz + D = 0 and oAx + oBy + aCz + D' = 0 are 
identical when D' = oD and otherwise never intersect. 
(b) Ina line. 


19. The line x — t, y = 2t,z = —St. 
21. (a) Do the first by working out each side in coordinates, and then use that and 
a x (b x c) = —(b x c) x a to get the second. 
(b) Use the identities in part (a) to write the quantity in terms of inner products. 
(c) Use the identities in part (a) and collect terms. 
23. Compute the results of Cramer’s rule and check that they satisfy the equation. 
25. x —2y+3z+12=0 27. 4x — 6y — 102 = 14 
29. 10x — I7y tz 25 «0 
31. For Exercise 19, note that (2, —3, 1) - (1, 1, 1) — 0, and so the line and plane are parallel 
and (2, —2, —1) does not lie in the plane. For Exercise 20, the line and plane are parallel and 
(1, —1, 2) does lie in the plane. 
33. 42/13 
35. (a) Show that M satisfies the geometric properties of R x F. (b) 2/3 
37. Show that n; (N x a) and n;(N x b) have the same magnitude and direction. 
39. One method is to write out all terms in the left-hand side and see that the terms involving 


À all cancel. Another method is to first observe that the determinant is linear in each row or 
column and that if any row or column is repeated, the answer is zero. Then 


a bi e a b a a b a a ba 
a + Adi bz 4 Abi Cc);4Àc|-—]|a b; cjXla b ci —]a) b cj. 
a bz C3 a3 by c a3 by c a by c 
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Section 1.4 
1. (a) 
Cylindrical Rectangular Spherical 
r 0 z x y z P 0 h 
1| 45° 1 J2/2 | J2/2 1 42 | 45° 45* 
2 | x2 | 4 0 2 -4 245 | 2/2 | x — arccos (2/5/5) 
0} 45° 10 0 0 10 10 45° 0 
3| z/6 4 343/2 | 3/2 4 5 1/6 arccos 4 
1| 2/6 0 V3/2 i 1 z/6 z/2 
2 | 37/4 | —2 -v2 | v2 |-2 242 | 37/4 37/4 
(b) 
Rectangular Spherical Cylindrical 
x y z p 0 o r 9 z 
2 1 | -2 3 arctan 4 x — arccos (2/3) V5 arctan + -2 
0 3 4 5 z/2 arccos (4/5) 3 2/2 4 
2 1 1 2 | arctan (/2/2) x/3 V3 | arctan (/2/2) 1 
-2/3 | —2 3 5 77/6 arccos H 4 17/6 3 


3. (a) Rotation by x around the z axis 
(b) Reflection across the xy plane 


5. No; (p, 0, $) and (—p, 8 +2, x — $) represent the same point. 


7. (a) e, = (xi + yj +zk)/ yx? + y? +27 = (xi + yj + zk)/p 
eg = (-yi + xj)/ Vx? + y? = (7yi + xj)/r 
ep = (xzi + yzj — (x? + y°)k)/rp 

(b) e x j 2 —yk/V/x? + y?, e; x j = (xz/rp)k + (r/p)i 


9. (a) The length of xi + yj + zk is (x? + y? + z2)? = p 
(b) cos = z/(x? + y? 4- z2)? 


(c) Rotation by 7/2 about the z axis together 
with a radial expansion by a factor of 2 


(c) cos@ = x/(x? + y3)" 


11. 0 <r <a,0 x 0 < 2m means that (r, 6, z) is inside the cylinder with radius a centered 
on the z axis, and |z| < 5 means that it is no more than a distance b from the xy plane. 


13. —d/(6cos¢) < p < d/2,0 < 0 < 2r, and x —cos (1) x6 < x 


15. This is a surface whose cross section with each surface z — c is four-petaled rose. The 
petals shrink to zero as |c| changes from 0 to 1. 
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Section 1.5 
1.3 


3. |x-y| = 10 = 5/20 = IIxllllyll, so |x- y| < Iixililyll is true. 
lx + yl] = 345 = lix] + Ilyll, so lix + yl] < lixll + Iyl] is true. 


5. |x-y| =5 < V65 = [[xllllyll, so x - y < [Ixilllyll is true. 
Ix + yll = 28 < V5 + V13 = IIxll + llyll, so lix + yl] < [xl] + Iyl is true. 


Se E- 
7. AB=|-1 11 3]|.det A= —5, det B = —24, 

-6 5 8 
det AB = 120(— det A det B), det (A + B) = —61(Z det A + det B) 


9. Hint: For k = 2 use the triangle inequality to show that ||xi + x2|| < |x: |] + []x2ll; then 
for k =i + 1 note that ||x, + X2 +--+ + Xll < lxi Xo +--+ xil + xia. 


11. (a) Check n — 1 and n — 2 directly. Then reduce an n x n determinant to a sum of 
(n — 1) x (n — 1) determinants and use induction. 

(b) The argument is similar to that for part (a). Suppose the first row is multiplied by A. 
The first term of the sum will be Aa); times an (n — 1) x (n — 1) determinant with no factors 
of A. The other terms obtained (by expanding across the first row) are similar. 


s 0 1 L0 
13. Not necessarily. Try 4 — [0 | and B = li bl 


15. (a) The sum of two continuous functions and a scalar multiple of a continuous function 
are continuous. 


(b) © (af + Bg)-h = fo (ef + Be)GORG) dx 
= fo foohGo) dx + B fü g(x)h(x) dx 
— af -h4 fg-h. 
Gi) feg = h feos) dx = fy Gf Qo dx =g: f. 
In conditions (iii) and (iv), the integrand is a perfect square. Therefore, the integral in 


nonnegative and can be 0 only if the integrand is 0 everywhere. If f(x) 4 0 for some x, then 
it would be positive in a neighborhood of x by continuity, and the integral would be positive. 


17. Compute the matrix product in both orders and check that you get the identity. 


19. (det A)(det A~!) = det (447!) = det (J) = 1 
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1. v+w = 4i + 3j + 6k; 3v = 9i + 12j + 15k; 6v + 8w = 26i + 16j + 38k; —2v = 
—6i — 8j —10k; v - w = 4; v x w = 9i + 2j — 7k. Your sketch should display v, w, 3v, 6v, 
8w, 6v+ 8w, v - w as the projection of v along w and v x w as a vector perpendicular to both 
v and w. 
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3. (a) Kr) 2 Ci -(Q4- Dj —k ©) —2x +y +22 =9 
(b) Kt) = Gt — 3) + (t - Dj — tk 


5. (a) 0 (b) 5 (c) —10 

7. (a) x/2.— (b) S/GVIS) — (c) -10/(/6/17) 

9. {sta+s(1—t)b|0<+t <1land0<s <1} 

11. Let v = (a1, a5, a3), w = (bi, b2, b3), and apply the CBS inequality. 
13. The area is the absolute value of 


aq a 
bi b 


- a a 
© [by +Aa by + Aag| 


(A multiple of one row of a determinant can be added to another row without changing its 
value.) Your sketch should show two parallelograms with the same base and height. 


15. The cosines of the two parts of the angle are equal, because 
a-y/|lallllvl| = (a -b  llalilibiD/Iivii = b + v/Iibilllvil. 


ijk 
100 
0 10 


15. igis 


19. (a) Hint: The length of the projection of the vector connecting any pair of points, one 
on each line, onto (a; x a2)/||a1 x ag|| is d. 


(b V2 


21. (a) Note that 


| i d 1 0 0 
l X: X; X3| = ! x x: x x: x l pee i, 
* [^i 2 a 1 e 41 334 = . 
2 2]»— = 
p: 3» J yı H= J5 —9i 79 id 
(b) j 
23. Rectangular Spherical (plot omitted) 


(à (V2/2,4/2/2,1)) (a) (V2, 1/4, 77/4) 
(b) (34/3/2,3/2, —4) (b) (5,7/6, arccos (—4/5)) 


(c) (0,0,1) (c) (1,2/4, 0) 
(d) (0,—2, 1) (d) (V5, 37/2, arccos (/5/5)) 
(e) (0,2, 1) (e) (V5, 2/2, arccos (/5/5)) 


25. z =r? cos20;cos¢ = p sin? cos 20 
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27. |x-y| = 6 < v98 = [Ixilllylls lix + yll = v33 < V14 + V7 = Ixli + Iyl 


29. (a) The associative law for matrix multiplication may be checked as follows: 
[(4B)C]; = SAB Cyy = X, » Ai Bi Cy; 
k=l k=l l=1 
= 1400) = [4GOX;. 
is! 


Use this with C taken to be a column vector. 
(b) The matrix for the composition is the product matrix. 


31. R” is spanned by the vectors ei, ej, ..., e,. If v € IR", then 


Tv — Yee] = Yw-enTer. 
i=l izl 


Let a;; = (Te; - e;), so that 


n 
Te; = J ayes. 
j=l 


Then 
n 
Tv-e = jS +; Agi 
i=l 
That is, if 
U uci Gin vi 
v=) 215 then Ty =a 
Un Ani +++ Ann Un 
as desired. 


33. (a) 70cos0 --20sin0 — (b) 214/3 + 6) ft - Ib 


35. Each side equals 2xy — 7yz + 5z? — 48x + 54y — 5z — 96. (Or switch the first two 
columns and then subtract the first row from the second.) 


37. Add the last row to the first and subtract it from the second. 


a dj 45 
39. (a) -ẹbi b; b| — (b) 1/3 
6 € C? G 
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41. Use the fact that ||a||? = a - a, expand both sides, and use the definition of c. 
43. (1/4/38) — (6/4/38)j + (1/V38)k 45. Q/45)i — (1//5)j 
47. (43/2)i + (1/2V2)j + (1/2/2)k 


Chapter 2 
Section 2.1 
1. The level curves and graphs are sketched below. The graph in part (c) is a hyperbolic 


paraboloid like that of Example 4, but rotated 45° and vertically compressed by a factor of 
1/4. To see this, use the variables u = x + y and v = x — y. Then z = (v? — 12)/4. 


z 2 
Intersection 
ı With xy plane 
E \ " 
* Intersection 
*; b 
Suas with xz plane 
Intersection uc 
with zy plane qe 
»- 


Section 


Section 2.1 617 


section x= —y 


section x= y 


(c) z 2 —xy 


3. For Example 2, z = r(cos0 + sin 0) + 2, shape depends on 6; for Example 3, z = r?, 
shape is independent of 0; for Example 4, z = r?(cos? 0 — sin? 0), shape depends on 6. 


5. The level curves are circles x? + y? = 100 — c? when c < 10. The graph is the upper 
hemisphere of x? + y? 4- z? = 100. 


7. The level curves are circles, and the graph is a paraboloid of revolution. See Example 3 
of this section. 


9. If c = 0, the level curve is the straight line y = —x together with the line x = 0. If 
c #0, then y = —x + (c/x). The level curve is a hyperbola with the y axis and the line 
y — —x as asymptotes. The graph is a hyperbolic paraboloid. Sections along the line y — ax 
are the parabolas z = (1 + a)x?. 
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11. Ifc > 0, the level surface f(x, y, z) = c is empty. If c = 0, the level surface is the point 
(0, 0, 0). If c < 0, the level surface is the sphere of radius ./—c centered at (0, 0, 0). A section 
of the graph determined by z = a is given by t = —x? — y? — a?, which is a paraboloid of 
revolution opening down in x yf space. 


13. Ifc < 0, the level surface is empty. If c = 0, the level surface is the z axis. If c > 0, itis 
the right-circular cylinder x? + y? = c of radius /¢ whose axis is the z axis. A section of the 
graph determined by z = a is the paraboloid of revolution £ = x? + y?. A section determined 
by x = b is a “trough” with parabolic cross section f(y, z) = y? + b?. 


15. Setting u = (x — z)/4/2 and v = (x + z)/4/2 gives the u and v axes rotated 45° around 
the y axis from the x and z axes. Because f = vy/2, the level surfaces f = c are 
"cylinders" perpendicular to the vy plane (z — —x) whose cross sections are the hyperbolas 
vy = c/V/2, so the section Sy-a N graph f is the hyperbolic paraboloid t = (z + a)y in yzt 
space [see Exercise 1(c)]. The section S,- N graph f is the plane t = bx + bz in xzt space. 
The section S,., graph f is the hyperbolic paraboloid t = y(x + b) in xyt space. 


17. Ifc « 0, the level curve is empty. If c = 0, the level curve is the x axis. If c > 0, it is the 
pair of parallel lines |y| = c. The sections of graph with x constant are V-shaped curves 
z — |y| in yz space. The graph is shown in the accompanying figure. 


z 


¥ 
(Exercise 17) 
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19. The value of z does not matter, so we get a “cylinder” of elliptic cross section parallel 
to the z axis and intersecting the x y plane in the ellipse 4x? + y? = 16. 


21. The value of x does not matter, so we get a “cylinder” parallel to the x axis of hyberbolic 
cross section intersecting the yz plane in the hyperbola z? — y? — 4. 


23. An elliptic paraboloid with axis along the x axis. 


25. The value of y does not matter, so we get a “cylinder” of parabolic cross section. 
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27. This is a saddle surface similar to that of Example 4, but the hyperbolas, which are level 
curves, no longer have perpendicular asymptotes. 


Level curves 


29. A double cone with axis along the y axis and elliptical cross sections 


3-4x1 + 227 
6= 2x2 =e UT 
” 
x 


31. Complete the square to get (x + 2) + (y — b/2Y + (z + iy = (b? -- Ab 4- 97)/4. This 
is an ellipsoid with center at (—2, 5/2, -2) and axes parallel to the coordinate axes. 
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33. Level curves are described by cos 20 = cr?. If c > 0, then —/4 < 0 < x/Aor 

31/4 < 0 < 51/4. If c < 0, then 2/4 € 0 < 32/4 or 51/4 < 0 < 71/4. In either case 
you get a figure-eight shape, called a /emniscate, through the origin. (Such shapes were first 
studied by Jacques Bernoulli and are sometimes called Bernoulli's lemniscates.) 


Section 2.2 


1. If (xo, yo) € A, then |xo| < 1 and |yo| < 1. Letr = min(1 — |xo|, 1 — |yo|). Prove that 
D, (Xo, yo) C A either analytically or by drawing a figure. 


3. Letr = min2 — yx? + 92, Vxl + ye — V2). 
5.()0 (5-12 (91 

7. (95 (00 (92x 

9.()0  (-1/ (90 


11. (a) Compose f(x, y) = xy with g(t) = (sin t)/t for t # 0 and g(0) = 1. 
(b 0 (c) 0 


13. (a) 1 (b) xoll — (c) (1, e) 
(d) Limit doesn't exist (look at the limits for x — 0 and y — 0 separately). 


15. Use parts (ii) and (iii) of Theorem 4. 
17. (a) Let the value of the function be 1 at (0, 0). (b) No. 


19. For |x — 2| < 8 = Ve 4-4 — 2, we have |x? — 4| = |x — 2||x 4- 2| < à(8 + 4) = £. By 
Theorem 3(iii), limit x? = (limit x)? =2?=4, 


21. Letr = ||x — y||/2. If Iz — yl| < r, let f(z) = |z — yll/r. If |lz—yll > r, let f(z) = 1. 
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23. (a) limit f(x) = L if for every e > Othereisaó > Osuchthatx > bandO0 <x —b <6 


imply | f(x) — L| < e. 
(b) limit (1/x) = —oc, limit e' = 0, and so limit e'/* = 0. Hence 
pers i06 


x07 


limit 1/(1 + e!/*) = 1. The other limit is 0. 
x07 
(c) 


25. Ife > 0 and xo are given, let 5 = (e/ K)'/*. Then || f(x) — f(xo)l| < K8” = £ whenever 
Ix — xol| < à. Notice that the choice of à does not depend on xo. This means that f is 
uniformly continuous. 


27. (a) Choose 8 « 1/500. (b) Choose 5 < 0.002. 
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1. (a) 9f/8x = y; 0f/8y =x 
(b) af/ax = ye" ;8f/8y = xe" 
(c) 8f/üx = cosx cos y — x sin x cos y; 3f /ð3y = —x cos x sin y 
(d) 8f/àx = 2x[1 + log (x? +y’); Af/Ay = 2y[1 + log (x? + y’); Gc y) # (0, 0) 


3. (a) dw/dx = (1 + 2x?) exp (x? + y?); dw/dy = 2xy exp (x? + y?) 
(b) dw/dx = —A4xy*/(x? — yy; üw/ày = 4yx2/(x? — y?) 
(c) 89w/8x = ye” log (x? + y?) + 2xe? /(x? + y’); 
9w/8y = xe” log (x? + y?) + 2ye” /(x? + y?) 
(d) dw/dx = 1/y;dw/dy = —x/y? 
(e) ð3w/ðx = —y e” sin ye” sinx + cos ye” cos x; 
ðw/ðy = (xye + e?)(—sin ye" sinx) 


5. z=6x+3y—11 


10 t Lg 
; ®© [0 | ©) bs. x j 


e xe’ —siny (y+xye y)? 
(b) | 1 0 (d) siny x cosy 
1 e 5y? 10xy 
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9. Atz— 1. 11. Both are xye™”. 


13. (a) Vf = (e 9" (-2x? + 1), -2xye 77? , 2xze n7) 
(b) Vf = (x? + y? - zy? (yzQ? + z! — x?) xz(x? + 2? — y?), xy? + y! — 22) 
(c) Vf = (z^e' cos y, —z?e" sin y, 2ze* cos y) 

15. 2x+6y—z=5 17. —2k 


19. They are constant. Show that the derivative is the zero matrix. 


Section 2.4 


1. This curve is the ellipse (y?/16) + x? = 1: 


ta 


3. This curve is the straight line through (—1, 2, 0) with direction (2, 1, 1): 


5. 61+ 6tj + 37k 7. (—2cost sint, 3 — 377, 1) 
9. c'(t) = (e', —sint) 11. c(t) = (t cost + sint, 4) 


13. Horizontal when t = (R/v)nz, n an integer; speed is zero if n is even; speed is 2v if n 
is odd. 


15. (sin3, cos3, 2) + (3 cos3, —3 sin 3, 5)(t — 1) 


17. (8,8,0) 19. (8,0,1) 
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Section 2.5 
1. Use parts (i), (ii), and (iii) of Theorem 10. The derivative at x is 2( f(x) + 1)Df(x). 
3. (a) A(x, y) = f(x, u(x, y)) = f(p(x), u(x, y)). We use p here solely as notation: 
p(x) 2 x. 


ðh  Ofdp + ðf ðu af 4 af ðu T dp _ dx 


ax  Opdx dudx Op dudx dx dx 


Written out: 


JUSTIFICATION: Call ( p, u) the variables of f. To use the chain rule we must express A as a 
composition of functions; that is, first find g such that h(x, y) = f(g(x, y)). Let g(x, y) = 
(p(x), u(x, y)). Therefore, Dh = (Df)(Dg). Then 


ah ah] [af A] ax ay | f af 
E 3- E] zl- ee =| 
ðx ay 
[9f , 9f du of du 
= au Ox ias) 


ðh Əf = af au ðh af af au " " 
and so dx Us + Bu dx You may see gu + Bu ax oS an answer. This requires 
careful interpretation because of possible ambiguity about the meaning of 3f /ð3x, which is 
why the name p was used. 
©) ðh of  Ofdu  Ofdv dh Of du Of du. Af dw 


ax dx dudx dudx © ðx ðuðx ðvəðx ðw dx 


5. Compute each in two ways; the answers are 
(a) (f o c)'(t) = e' (cost — sint) 
(b) (f o c) (t) = I5r* exp (36) 
(c) (foe) (t) = (e —e* [I + log (e” +e] 
(d) (foey(t) = (1 c 4?) exp (217) 
7. Use Theorem 10(iii) and replace matrices by vectors. 


9. (fog), y) = (tan (> — 1) — e, e0) — (x — y) and 
pyoatn=[> 2] 
11. i cos (1) cos (log 42) 
13. —2cost sinte?"' + sin‘ t + cos? te! — 3 cos? t sin? t for both (a) and (b). 


15. (2,0) 
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ðG  0Gd 
17. (à) G(x, y) 20 adso Z422 Lg, 
ax dy dx 
dy, 8G, 9Gi] ['8G, 
dx | dy 8y) Ox ET n : 
(b) dis E 0G, 9G, 3G; where ^! means the inverse matrix. 
dx ðyı Oy2 Ox 


The first component of this equation reads 


9G, dG, | 9G; 8G, 
dy Ox dy2 Ox y, 
dx 96,96; 9G2 9G, 


dy. Oy2 — 8yi yo 


dy —2x 


© dx 3y re 


19. Apply the chain rule to 8G/8T where G(t(T, P), p(T, P), V(T, P)) = 
P(V — b)e"/**T — RT is identically 0; (T, P) = T; and p(T, P) = P. 


21. Define Ri(h) = f(xo + h) — (xo) — [Df (xo)]h. 
23. Let gı and g be C' functions from IR? to IR such that g\(x) = 1 for ||x|| < 42/3; 


g1(x) = 0 for |xl| > 22/3; go(x) = 1 for IIx — (1, 1,0)I| < /2/3; and go(x) = 0 for 
Ix — (1, 1, Ol| > 24/2/3. (See Exercise 22.) Let 


1 0 0] fm 1 0 0 -I] [x 
A(x)-|0 -1 0/|x | |1 and  h()-|0 0 0O0||x|, 
0 0 0|lx 0 00 1] |x 


and put f(x) = gi(x)ii(x) + go(x)ho(x). 


25. Proof of rule (iii) follows: 


Vix) — (xo) — Lf (xo)Dg(xo) + g(xo)D/(xo)Y(x — Xo)| 
lix — xoll 


MEN IgG) — g(xo) — Dg(xo)(x — xo)l 


lix — xoll 
n |f) — f (xo) — Df(xo)(x — xo) 
lix — xoll 
1/69 — f(xo)l Ig) — g(xo)l lix — xol. 
lix — xoll lix — xoll 


As X — Xo, the first two terms go to 0 by the differentiability of f and g. The third does so 
because | f(x) — f(xo)|/llx — xol| and |g(x) — g(xo)l/llx — xol| are bounded by a constant, 
say M, on some ball D, (xo). To see this, choose r small enough that [ f(x) — /(xo)]/ 

lx — xoll is within 1 of Df(xo(x — xo)/llx — xoll if |x — xoll < r. Then we have 
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IFC) — f(xo)l/Ilx — xoll < 1 + IDf(xo(x — xo)l/lix — xoll = 1 + IVf(xo) - (x — xo)l/ 
lx — xoll < 1+ ||Vf(xo)|| by the Cauchy-Schwarz inequality. 

The proof of rule (iv) follows from rule (iii) and the special case of the quotient rule, 
with f identically 1; that is, D(1/g)(xo) = [—1/g(xo)*]Dg(xo). To obtain this answer, note 
that on some small ball D, (xo), g(x) > m > 0. Use the triangle inequality and the Schwarz 
inequality to show that 


ES = ES p za Dro — Xo) 
Ix — xoll 
1 1 |g(x) — g(xo) — Dg(xoY(x — xo)l 
^ le] lgo) lix — xoll 


lg(x) — g(x0)| [Dg(xo)(x — xo)l 
lgG)lg(xoy lix — xoll 


1 |g(x) — g(xo) — Dg(xo)x — xo) IVg(xo)Il 
om IIx — xol mi 80) = go). 


These last two terms both go to 0, because g is differentiable and continuous. 


27. First find formula for (8/x)(F(x, x)), using the chain rule. Let F(x, z) = fj f(z, y)dy 
and use the fundamental theorem of calculus. 


29. By Exercise 26 and Theorem 1 0(iii) (Exercise 25), each component of k is differentiable 
and D&;(xo) = /(xo)Dg;(xo) + g;(xo)Df (xo). Because [Dg;(xo)]y is the ith component of 
[Dg(xo)]y and [D/(xo)]y is the number Vf (xo) - y, we get [Dk(xo)]y = f(Xo)[Dg(xo)]y + 
[D (xo)ly[g(xo)] = fo) [Dg (xo)ly + [V (xo) - y]g(%o). 


Section 2.6 
1. Vf(1, 1, 2)* v = (4,3, 4) -(1/4/5, 2/4/5, 0) 24/5 
3. (a) 17e°/13 — (b e/V3 (0 


5. (a) z-9x = 6y — 6 (b zy =x/2 (c)z=1 
7. (a) -730+ +9 (b) 2i + 2j 4- 2k (c) —4(i+j+k) 


9. k 


11. The graph of f is the level surface 0 = F(x, y, z) = f(x, y) — z. Therefore, the tangent 
plane is given by 


0 = VF (xo, Yo, Zo) (X — X0, Y — Yo» Z — zo) 


= (Lov, ii 2) * (x — x0, y — Yo. Z — zo). 
x oy 
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Because zo = f (xo, yo), this is z = f (xo, yo) + (8f/8x)(xo. yo)((x — xo) + 
(8f/3y xo. YoY — yo). 


13. (a) Vf =( + y,z + x,x + y), g(t) = (e, — sint, cost), 
(f ogJ(1) = 2ecos 1 + cos? 1 — sin? 1 
(b) Vf = (yze"", xze"" , xye”), g (t) = (6, 60,312), (f o g)(1) = 108e!* 
(c) Vf — [1 + logG? +y? +.2*)I@i+ yj + 2k), g = (e, —e™, 1), 
(f ogJ(1) = [1 + log(e? + e? + ye? — e? + 1) 


15. Let f(x, y, z) = 1/r = (x? + y? +2?) "?;r = (x, y, z). Then we calculate 
Vf = —( y! T zy? (x, y, z) = -(/rPr. 


17. Vf = (g'(x), 0). 
19. Df(0,0,...,0) = [0, ..., 0] 


21. d, = [—(0.03 + 25y,)/2a]i + yıj, do = [-(0.03 + 25y;)/2a]i + yzj, where yı and y; 


2 
are the solutions of (a? + 52)y? + 0.03by + (e EN a’) =ð. 


À x+x x—xoY. 1 DX 
23. VV — 2; — 
Ire [( Ho is (s 3)1 


25. Crosses at (2, 2, 0), 4/5/10 seconds later. 


Review Exercises for Chapter 2 
1. (a) Elliptic paraboloid. 
(b) Let y' = y +3 and write z = xy’. This is a (shifted) hyperbolic paraboloid. 
2xy x? 
-ye -—xe^ 


3. (a) vre» =| al (c) Df@y.z))=[e e e] 


: 100 
(b) »e-[] (d) Df, y, 2) — |0 1 0 
001 


5. The plane tangent to a sphere at (xo, yo, zo) is normal to the line from the center to 
(Xo; Yo, zo). 
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7. (a) z=x—y+2 (d) 10x + 6y — 4z 2 6— x 
(b)z—4x—8y—8 —(e)2z— 2x - J2y 
(6) e+ y+2=-1 (f) x-F-2y 2-2 


9. (a) The level curves are hyperbolas xy = 1/c: 


11. (a) 0 (b) Limit does not exist. 


13. (1 + 2x2) exp(1 +x? +y?) 
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15. (a) The line L(t) = (xo, yo, f (xo. yo)) + t(a, b, c) lies in the plane z = f'(xo, yo) if 
c = 0 and is perpendicular to Vf (xo, yo) if a(8f/8x)(xo, yo) + £(8b/8y)(xo, yo) = 0. On L, 
we have 


I (Xo yo) + ES »]e xo) + ES »]e yo) 
x oy 


= f (x0, yo) + a| Len »)] + nes »] 
x oy 
= f(X0, yo) 2 z 


Therefore, L lies in the tangent plane. An upward unit normal to the tangent plane is 
p = (1 + IVS I 7 (-(3f/8x)Go. yo), — (8f /y)(o. yo), 1). Therefore, cosó = p -k = 
(1+ IV IP)", and tan6 = sin6/cos6 = (IV I?/( + IV£I) 7/0 + VAP)? = 
|| V.£|| as claimed. 

(b) The tangent plane contains the horizontal line through (1, 0, 2) perpendicular to 
Vf(1, 0) = (5, 0), that is, parallel to the y axis. It makes an angle of arctan (|| V/(1, 0)||) = 
arctan 5 ~ 78.7° with respect to the xy plane. 


17. (1/42, 1/42) or (-1/4/2, —1/4/2) 


19. A unit normal is (4/2/10)(3, 5, 4). The tangent plane is 3x + 5y + 4z = 18. 
21. 4i 4- 16j 


23. (a) Because g is the composition A +> Ax > f(Ax), the chain rule gives 


x1 
g'(A) = DfQx)| : 
Xn. 
Thus, 
xi 
£() - DfQ)| : | 2 VfG9-x 
Xn. 


But also g(A) = 4” f(x), so g/(A) = pa?! f(x) and g'(1) = pf (x). 
(b) p=1. 


25. Differentiate directly using the chain rule, or use Exercise 23(a) with p = 0. 


27. (a) If (x, y) # (0, 0), then one calculates for (i) that Af/8x = (y? — yx?)/(x? + y?yY 
and 8f/8y = (x? — xy?)/(x? + y?Y.. If x = y = 0, use the definition directly to find that both 
partial derivatives are 0. For (ii), if (x, y) # (0, 0), then 3f/ð3x = 2xy$/(x? + y4)? and 

af/day = Qx*y — 2x?y°)/(x? + y*). The partials at the origin are zero. 
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(b) The function (i) is not continuous at (0, 0); the function (ii) is differentiable, but the 
derivative is not continuous. 


29. (a) /21/8 — (b) —sin/2 — (c) 224/2e7? 


31. (—4e!, 0) 
33. (a) See Theorem 11. 
(b) g(u) = (sin 3u)? + cos 8u Vf = (2x, 1) 
g (u) = 6sin 3u cos 3u — 8 sin 8u Vf(h(0)) = V f(0, 1) = (0, 1) 
g(0) —0 h'(u) = (3 cos 3u, —8 sin 8u) 


g'(0) = Vf(h(0)) - h'(0) = (0, 1) (3,0) = 0 
35. t = //14(—3 + V359)/70 = (—3 + /359)/5A/14 


37. 8z/üx = 4(e? - P14 y)/(e?*-?» — ey? 
Bza = 4er? x) Yle — ey 


39. Notice that y — x?, so that if y is constant, x cannot be a variable. 
Al. [F'OLA + f()g (Ol exp L/()8(0)] 
43. d[f(c(t))]/dt = 2t/[(1 + t? + 2cos? t2 — 26? + 1*)] 
—4t(t? — 1) In(1 + t? --2cos? t)/(2 — 2? + r*Y 
—4costsint/[(1 + t? + 2cos? t)(2 — 21? + 1*)] 


45. Let x = f(t), y = t, and use the chain rule to differentiate u(x, y) with respect to t. 


47. (a) n= PV/RT; P =nRT/V;T = PV/nR;V =nRT/P. 
(b) 8V/8T =nR/P;8T/AP = V/nR;8P/8V = —nRT/V?. Multiply, remembering 
that PV =nRT. 


49. (a) One can solve for any of the variables in terms of the other two. 
(b) 3T/3P = (V — B)/R; 
aP/aV = —RT/(V — BY. + 2a/ V?; 
aV/aT = R/V — BXRT/( — BY — 2a V3] 
(c) Multiply and cancel factors. 


51. (a) (1/42, 1/4/2) 


(b) The directional derivative is 0 in the direction 


(xoi + yoj)/V xà + ¥0- 


(c) The level curve through (xo, yo) must be tangent to the line through (0, 0) and 
(Xo, yo). The level curves are lines or half-lines emanating from the origin. 


53. G(x, y) 2x —y 
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Chapter 3 
Section 3.1 
ï ar xy-xy af =x’ y + xy? 
" 8x? (x? Ly ay? (x2 m i 
Pf f — —6x* + 36x2y? — 6y4 
üxüy — Aydx (x? + yy 
2 2 
3. of = —y* cos(xy?), a = —2x sin (xy?) — 4x?y? cos (xy?), 
a a : 
E = pis = —2ysin (xy?) — 2xy! cos (xy?) 
s 9f cos? x -F e)cos2x --2sin? 2x 
"ax? (cos? x +e) : 
Pf ^ e” —cos’x 
8y?  e"(cos!x +e} 
of of 2sin2x 


ðxðy dydx  e"(cosx e»? 


7. (a) 9?z/üx? = 6, dz/ay? = 4, (b) 2?z/8x? = 0, 0?z/8y? = Ax /3y?, 
O?z/üx dy = 0?z/üy ax = 0 8z/dx ay = 0?z/0y üx = —2/3y? 


9. fey = 2x + 2y, fy = 22, fix — 0, feyz = 0 


11. Because f and 3f/ðz are both of class C?, we have 


of 9? af 9 af afl E afar\ af 
üx8yüz Axdy Oz  ðyðxðz  ðy\Əðxəðz] Əy \ az ax dy Oz Ox" 


13. fa, = fewx = €% [2xy cos (xw) + x?y?z cos (xw) — x? yw sin (xw)] 


af x xy 

15. — = arctan — + ———— 

(a) Jx arci i" Tay 
af — —x? 
ðy x+y 
rf 2y? f 2x?y 
üx? (x? nm ? ay? (x? nm 
rf of —2xy? 


ðxðy  Oyóx (x? +y? 
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Of | —xsinyx?c- y? 0f —ysin yx? + y? 


ur VZ+» cy Vt + y 
9f ^ x sinyx? +y? x’cosyx? +y? _ sinyx? +y? 
3x2 nom x2 + y2 (2 + yay 
Bf y? sin yx? + y? B y? cos yx? + y? _ siny? c y? 
ay? (x? + yp x2 +y? (x? + y? 
rr of siny/x?--y? cos /x? + y? 
üxüy day dx ii (x? + yy? x? + y? 
8 9, 
€) S.L ase — 9, S. aye! — 9, 
ax ay 
of 2 2 »* af 2 2 2 
pet AO -23ep(-r y) i7 (4y* — 2)exp(—x* — y), 
Pf Pf 


= mit ? RT 
axdy dydax ay ADI =y 


- af (dx\ " a dxdy Wf dy ex af dy 
` ax? \ dt üx8y dt dt 3y? V dt ax dt? ` ay dt?’ 


where e(t) = (x(t), y(t)) 
19. Evaluate the derivatives 0?u/8x? and 0?u/8y? and add. 


21. (a) Evaluate the derivatives and compare. 


(b) ó 


23. V =—GmM/r = —GmM(x? + y? + z?)*!?, Check that 
V Oy sv 


+- = GmM y -£yp-3ao?y 4-277 $y 427) ']=0 


ax? t ay? ðz? 


m 
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Section 3.2 


1. f(h,h;)- hî 2hiho +h} [R2(0, h) = 0 in this case] 


hi hi 
3. fh h) 23h + hy +> + hile +> + R(0, h) 


5. f(hi, h2) = 1 + hyh2 + R2(0, h) 


7. (a) Show that |R;(x, a)| < 4B*+*'/(k + 1)! for constants A, B, and x in a fixed interval 
[a, b]. Prove that Ry — Oas k — oo. (Use convergence of the series 5 c/k! = ef and use 
Taylor’s theorem.) 

(b) The only possible trouble is at x = 0. Use L'Hópital's rule to show that 


limit p(t)e' = oo 
1-00 


E 


for every polynomial p(t). Using this, establish that limit p(x)e = 0 for every rational 
x30 


function p(x), and conclude that f(0) = 0 for every k. 
(c) f:R" > Ris analytic at xo if the series 


9? 
fsa) + Y Lati ;àw reas eo 


ak 
4 P» hihi => hugs ge 0) 


ll 
converges to (xo + h) for all h = (hı, ..., An) in some sufficiently small disk ||h|| < £. The 


function f is analytic if for every R > 0 there is a constant M such that |(0* f/àx;, - - + Əxi) 
(x)| < M* for each kth-order derivative at every x satisfying ||x|| < R. 


dx JUN ce op 
(d) foo ene o eno! e (ere 
= 


Section 3.3 
1. (0, 0); saddle point. 


3. The critical points are on the line y = —x; they are local minima, because f(x, y) = 
(x + yy. = 0, equaling zero only when x = —y. 


5. (0, 0); saddle point. 
7. (—}, —4); local minimum. 
9. (0, 0); local maximum. (The tests fail, but use the fact that cos z < 1.) 


(172, /7/2), local minimum 
(0, Jt), local minimum. 
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11. No critical points. 13. (1, 1) is a local minimum. 
15. (0, nz); critical points, no local maxima or minima. 

17. Minimum at (0, 0) and maxima at (0, +1) [and saddles at (+1, 0)]. 

19. (a) af/dx and 8f/8y vanish at (0, 0). 


(b) Show that f(g(t)) = 0 at t = 0 and that f(g(r)) = 0 if |t| < |b|/3a?. 
(c) f is negative on the parabola y = 2x?. 


21. The critical points are on the line y — x and they are local minima (see Exercise 3). 
23. Minimize S = 2xy + 2yz + 2xz with z = V/xy, V the constant volume. 
25. 40,40, 40 
27. The only critical point is (0, 0, 0). It is a minimum, because 
2 2 


1 
fayz) T +2 xy-lG eg xD 


29. (1, 3) is a saddle point; (5, 34) is a local minimum. 
31. 3 is the absolute maximum and 0 is the absolute minimum. 
33. —2 is the absolute minimum; 2 is the absolute maximum. 
35. G, 4) is a local minimum. 
37. If u,(x, y) = u(x, y) + (1/n)e*, then Vu, = (1/n)e* > 0. Thus, u, is strictly 
subharmonic and can have its maximum only on 8 D, say, at p, = (Xn, Yn). If (xo, yo) € D, 
check that this implies u(x,, Yn) > u(xo, yo) — e/n. Thus, there must be a point q = (Xo, Yoo) 
on à D such that arbitrarily close to q we can find an (x, Yn) for n as large as we like. 
Conclude from the continuity of u that u(x, Yoo) = u(xo, yo). 
39. Follow the methods of Exercise 37. 
41. (a) If there were an x; with f(xi) < (xo), then the maximum of f on the interval 
between xo and x would be another critical point. 
(b) Verify (i) by the second derivative test; for (ii), f goes to —co as y — oo and 

x=-y. 
Section 3.4 

1. Maximum at V3a. —], 1), minimum at Jic 1, -1) 


3. Maximum at (4/3, 0), minimum at (—4/3, 0) 
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5. Maximum at (Fe E.) minimum at Ce - 


7. The minimum value 4 is attained at (0, 2). Use a geometric picture rather than Lagrange 
multipliers. 


9. (0, 0, 2) is a minimum of f. 
11. H is the absolute maximum and 0 is the absolute minimum. 
13. The diameter should equal the height, 20/3/20 cm. 
15. Maximum value 4/3 at Gs. F -*) and minimum value —4/3 at Ch E 3 
17. Horizontal length is y/q A/p, vertical length is /pA/q. 


19. For Exercise 1, the bordered Hessians required are 


- 0 2x 2y 
Ha = |2x -24 0|-28AQ?4 ?, 
2y 0 -2. 


0 2x 2y 2z 
2x -2 0 0 
2y 0 -2 0 
2z 0 0 -2 


CES = —16A(x? + y! +27). 


At NEU —1, 1) the Lagrange multiplier is à = 4/6/4 > 0, indicating a maximum at 


3a, —1, 1), anda = —/6/4 < 0 indicates a minimum at Jc 1, 1, —1). In Exercise 5, 


|H| = 24X (4x? + 6y?), and so A = 4/70/12 > 0 indicates a maximum at (9/4/70, 4/4/70) 
and à = —4//70/12 < 0 indicates a minimum at (—9/4/70, —4/4/70). 


21. 11,664 in? 
23. (a) Vf(x) — Ax. 
(b) S is defined by the constraint function g(x) = x? + x} + x? — 1. Because 
Vg(x) = 2x is not 0, Theorem 9 applies. At an x where f is extreme, there is a 4/2 such that 
Vf(x) = (4/2)/Vg(x). That is, Ax = Ax. 


25. Minimum is (—1/4/2, 0) maximum is G, 3-4/7/8), local minimum at (1/4/2, 0). 


27. No critical points; no maximum or minimum 
29. (—1,0, 1) 


31. The point (K, L) = (@B/q, (1 — a)B/p) optimizes the profit. 
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Section 3.5 


1. Use the special implicit function theorem with n — 1. (See Example 1.) Line (i) is given 
by 0 = (x — xo, y — yo) + VF (Xo, yo) = (x — xo(0 F /0x)(xo, yo) + W — yoX8 F/8y)(xo. yo). 
For line (ii), Theorem 11 gives dy/dx = —(0F/dx)/(0F/dy), and so the lines agree and are 
given by 
(8.F/x xo, yo) 


-OFIN 79 


Y=» 


3. (a) Ifx « -h we can solve for y in terms of x using the quadratic formula. 
(b) 3F/ðy = 2y + 1 is nonzero for (y | y < -1) and (y | > — $}. These regions 
correspond to the upper and lower halves of a horizontal parabola with vertex at (— ; 5 -} 
and to the choice of sign in the quadratic formula. The derivative dy/dx = —3/(2y + 1) is 


negative on the top half of the parabola, positive on the bottom. 


5. Let F(x, y, z) = x82? — 23 yx; ð F/ðz = 2x?z — 3z*yx Æ Oat (1, 1, 1). Near the origin, 
with x = y Æ 0, we get solutions z = 0 and z = x, and so there is no unique solution. At 
(1, 1), 8z/üx = 2 and dz/dy = —1. 


7. With F, = y +x + uv and F, = uxy + v, the determinant in the general implicit 
function theorem is 
ƏFı/ðu 8Füv| —— — 
8F,/8u 8F]0v| -" "*» 
which is 0 at (0, 0, 0, 0). Thus, the implicit function theorem does not apply. If we try directly, 
we find that v = —uxy, so x + y = u?xy. For a particular choice of (x, y) near (0, 0), either 
there are no solutions for (u, v) or else there are two. 


9. No. f(x, y) = (—1, 0) has infinitely many solutions, namely, (x, y) = (0, y) for any y. 


11. (a) xg + yè #0. 
(b) f'(z) 2 —6 +2y/Q?+y)s g'G) = zy — 2x)/@? + y?). 


13. Multiply and equate coefficients to get ag, a1, and a» as functions of r1, r2, and r3. Then 
compute the Jacobian determinant 8(ao, a), a2)/8(ri, r2, r3) = (rs — rari — r2) — r3). 
This is not zero if the roots are distinct. Thus, the inverse function theorem shows that the 
roots may be found as functions of the coefficients in some neighborhood of any point at 
which the roots are distinct. That is, if the roots rj, r2, r3 of x? + a2x? + aix + ao are all 
different, then there are neighborhoods V of (ri, r2, r3) and W of (ao, a, a2) such that the 
roots in V are smooth functions of the coefficients in W. 
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1. (a) Saddle point. 
(b) Saddle point for any C. 


3. (a) 1 (b) 83/6 
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5. Use the second derivative test; (0, 0) is a local maximum; 
(—1, 0) is a saddle point; (2, 0) is a local minumum. 


7. Saddle points at (n, 0), n = integer. 
9. Maximum 7 2.618, minimum ~ 0.382. 
11. Maximum 1, minimum cos 1 
13. z= 1/4 
15. (0,0, +1) 
17. If b > 2, the minimum distance is 24/b — 1; if b < 2, the minimum distance is |b]. 
19. Not stable. 
21. f(-3, —V3/2) = 34/3/4 
23. x = (20/3)4/3; y = 104/3;z = 5/3 


25. The determinant required in the general implicit function theorem is not zero, and so we 
can solve for u and v; (8u/8x)(2, —1) = 13/32. 


27. Anew orthonormal basis may be found with respect to which the quadratic form given 
by the matrix 
bd 
k E 
b e 


takes diagonal form. This change of basis defines new variables and n, which are linear 
functions of x and y. Manipulations of linear algebra and the chain rule show that 

Lv = A(0?v/9£?) + u(3?v/3n?). The numbers A and y. are the eigenvalues of A and are 
positive, because the quadratic form is positive-definite. At a maximum, ðv/ð = dv/dn = 
0. Moreover, 0?v/8£? < 0 and 3?v/3n? < 0, because if either were greater than 0, the cross 
section of the graph in that direction would have a minimum. Then Lv < 0, thus 
contradicting strict subharmonicity. 


29. Reverse the inequalities in Exercises 27 and 28. 

31. The equations for a critical point, ds /8m = ðs /ðb = 0, when solved for m and b give 
m = (yi — y2)/ 6a — x2) and b = (yoxi — y1X2)/(%1 — x2). The line y = mx + b then goes 
through (xi, y1) and (x2, y2). 

33. Ata minimum of s, we have 0 = ðs /3b = —2» 7 (y; — mx; — b). 


35. y=ġx+É 
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Section 4.1 


1. r'(r) = —(sint)i + 2(cos27)j, r'(0) = 2j, a(t) = —(cost)i — 4(sin21)j, a(0) = —i, 
Kt) =i+ 2rj 


3. r'(t) = Vit e'j — e™'k, r'(0) = /2i + j — k, a(t) = e'j + e™'k, a(0) = j +k, Kr) = 
4Dti - (1 -- £j 4- (1 — Dk 


5. (e! — e^, cost — sint, —3t?) 


7. [-3f^(2sint + cost) — P (2 cost — sint)]i + BP RE +e) + Pe! — e~) 
+ [e'(cos t — sint) — e™(— sint + cost)]k 


9. m(0, 6, 0) 
11. —24z  (cos(2t/5), sin (27t /5))/25 


d d dv 
13. © (ivi?) = T) = 2v swa 


15. 6129 seconds 


P p 
17. c(t) = e dx 1) 


19. (a) e(t) = (t, e'), —oo < t < co. The image of this path is the graph y = e*. 
(b) e(t) = (3 cost, sint), 0 < t < 27r, an ellipse. 
(c) e(t) = (at, bt, ct) 
(d) e(t) = (3 cos t, } sint), 0 < £ < 2x, an ellipse. 


21. ¢(t) x e'(r) is normal to the plane of the orbit at time /. As in Exercise 20, its derivative is 
0, and so the orbital plane is constant. 


Section 4.2 
1. 2/52 3. 2042-1) 
6-43 1 2/2 +3 
5. —— + = log | ——— 7. 2n(4 5 4- V2 
A EE zx (4/5 + 4/2) 
9. 3+ log2 


11. (a) Because o is strictly increasing, it maps [a, b] one-to-one onto [o(a), a(b)]. By 
definition, v is the image of c if and only if there is a ¢ in [a, b] with c(t) = v. There is one 
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point s in [o(a), a(b)] with s = a(t), so d(s) = e(t) = v. Therefore, the image of c is 
contained in that of d. Use o7! similarly for the opposite inclusion. 


(b) a(b) s=a(b) 
la T id'G)ll a - f lid (o(0)lo (o) dt 


(a) s=a(a) 


I 


t=b b 
P Woo tar = f lie (li dt = le. 


(c) Differentiate d using the chain rule. 


13. (a) =f? le(s) ds = f? ds 2b—a 

(b) T(s) = e'(s)/lle'(s)I| = e'(s), so T'(s) = e" (s). Then k = ||T'|] = lle" s)Il. 

(c) Show that if v and w are in R?, ||v x wl] = lw — (v * w/Ilvl)vll-I|v]]. Use this to 
show that if p(t) = (x(t), y(t), z(t) is never (0, 0, 0) and f(t) = p(1)/||p(t)]|, then 


dt foll loc)? di WOW 


ao d [yo POPO o| ad E He) x PON 
With p(t) = c'(t), this gives 


aO OO, sou eO x eI 
ES — ————— d T LL. 
ier op O 7) TOIS "ugs 


T() 


If s is the arc length of c, ds/dt = ||e’(t)||, and therefore 


dT dT ds r 
Fl- di = K|e'(t)ll- 
Thus, 
1 dT [ex ÀO 
leol dt ler” 


(This result is useful in Exercise 15.) 


(d) 1/v2 


15. (a) Because c is parametrized by arc length, T(s) = e'(s), and N(s) = e”(s)/|le"(s)Il- 
Use Exercise 13 to show that 


d f, &\,, (m oem, 
€ x €x = € 
ds lel) - le^] lle"? 
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and 


dB (e xe").e — (e xe)-e" 
ds lie"l? lle" I? 


(b) Obtain T’(t) and ||T'(ż)|| as in Exercise 13. B is a unit vector in the direction of 
e x T' = (e' x e”)/|le’|], so B = (e' x e”)/|le’ x e"||. Use the solution of Exercise 13 with 
p — €' x e" to obtain 


dB/dt = (e x €^)/lle' x e” — (Ie x e”) (e x e")/Ile x Pe x e^), 
and the values of T' and ||T'|| to get 
N = (lle ll/Ile x e'lle" — (e x ele I. 
Finally, use the chain rule and the inner product of these to obtain 


dB (e x e): e" 
|ds/dt| dt le x el ^ 


dB 
t= -|F wo] “N(s(t)) 
s 


() 42/2 


17. (a) N is defined as T'/||T'||, so T' = ||T'||N = KN. Because T- T' = 0, T, N, and B are 

an orthonormal basis for IR?. Differentiating B(s) - B(s) = 1 and B(s)- T(s) = 0 shows that 

B' -B = 0 and B'- T + B. T' = 0. But T'- B = ||T’||N- B = 0, so B'- T = O also. Thus, 

B = (B'- T)T + (B'- N)N + (B'- B)B = (B'- N)N = —zN. Also, N’- N = 0, because 

N-N = I. Thus, N' = (N'- T)T + (N'- B)B. But differentiating N - T = 0 and N - B = 0 

gives N'- T = —N- T' = —k and N'- B= —N- B' = q, and so the middle equation follows. 
(b) w — tT + kB 


19. Follow the hint in the text. 


Section 4.3 


1. 
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5. F = (2y,x): 7. 


9. The flow lines are concentric circles: 


11. The flow lines for t > 0: 


13. c(t) = Qe, 1/1, 1/2) = F(e(t)) 


15. c'(t) = (cost, — sint, e) = F(c(t)) 
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17. Compare im for the escape velocity v, = /2g Ro and the velocity in an orbit of radius 
Ro given in Section 4.1. (Ignore the rotation of the earth.) 


19. Use the fact that — VT is perpendicular to the surface T — constant. 


Section 4.4 
1. ye” — xe” + ye” 3. 3 


5. div V > 0 in the first and third quadrants, 
div V < 0 in the second and fourth quadrants. 


7. V -F = 0; if F represents a fluid, there is neither expansion nor compression; the area of 
a small rectangle remains the same. 


y 
F: 

9. 3x? — x? cos (xy) 11. ycos(xy) + x? sin(x?y) 
13. 0 
15. (10y — 8z)i + (6z — 10x)j + (8x — 6y)k 
17. —sinx 19. x 
21. Vx Vf =0 23. Vx Vf — 0 
25. Vx F Z0 


27. Let F = Fii 4- Fj + Fk and compute both sides of the identity. 


29. (a) 2xyi + x?j (c) (—y?zx3, 2x? y4z, 2x? z? — 2xy) 
(b) (3y?xz, 4xz — y?z, 0) (d) 4 yz? + x? 
31. No. 


33. Separate each expression into its real and imaginary parts and then treat the 
resulting quantity as a vector field on R?. Directly calculate its curl and divergence. 


Review Exercises for Chapter 4 


In (a), F = (x? — y?)i — 2xyj; in (b), F = (x? — 3xy?)i + 9? — 3x? y)j; and in (c), 


F— 


(e cos y)i — (e sin y)j. Show that V - F = 0 and V x F = 0 in each case. 


Review Exercises for Chapter 4 


1. 


11. 


13. 


15. 


17. 


21. 


23. 


25. 


27. 


29. 


31. 


33. 


v(1) = (3, —e7!, —/2); a(1) = (6, e~", 0); 
s(1) = /9-- e? + Xt) = (2, €, 0) + (t — DG, —e7!, 7/2) 


v(0) = (1, 1, 0); a(0) = (1,0, —1);s = 4/2; I(t) = (1,0, 1) + ((1, 1,0) 


Tangent vector: v = —(1//2)i+ (1/4/2)j + k 

Acceleration vector: a = —(1/4/2)(i + j) 
4 

m(2, 0, —1) 9. | V 14 $C? + 1795 dt 
1 

(a) v = (—2t sin(t?), 2t cos(t?), 0); s = 2t 

1 43 
© (s 79) 
(c) J5x/3 


(d) v = 24/51/3(4/3/2, 1/2, 0; s = 24/513 
3. Sm 
e (1 8)/ 


i t 2 t 
SS hPa Peay pees 


Compute c'(t) and check that it equals F(c()). 

9:0 19. 3;-i-j—k 

0;-i—j—E 

Vf = (ye? — ysinxy, xe” — x sin xy, 0); verify that V x Vf = 0 in this case. 
Vf = Qxe + y! sinxy?, 2xy sinxy?, 0); check that V x Vf = 0 from this. 


(a) (yz?, xz^,2xyz; (b) (z — y, 0, —x) 
(c) Qxy2z) — 3xy?z?, 2x? y?z — yz}, y?z? — 2x? yz?) 


div F = 0; curl F = (0, 0, 2(x? + y?) f(x? + y?) + 2 f(x? 4- y?) 


(a) A cone about i’ making an angle of 2/3 with ï. 
(b) Vg = (3x?, 5z, Sy + 2z) 


(a) [3 P/3x} + (8P/8yy]? 
(b) A small packet of air would obey F — ma. 
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(c) (d) 


N s C ds 

Wind direction S Wind direction 
(x 
© G 
E 


R? + p? 2(R? + p?yz 
a op nado M e a 
p gp 


W 


37. 680 miles per hour 


Section 5.1 
1. (a) B (b zi (c) 1 (d) log2 — 2 


3. To show that the volumes of the two cylinders are equal, show that their area functions 
are equal. 


5. 2r*(tan0)/3 7. 9 
9. (2/1 (e? 4- 1) n. 5 
Section 5.2 


L()i (e-2  (Oisml  (d2m4-2 
3. 1/4 


5. Use Fubini's theorem to write 


[f@)gQ)] dx dy = re " fede dy, 
R c a 


and notice that f, i f(x) dx is a constant and so may be pulled out. 


Section 5.3 645 


7. 11/6 


9. Because f dy = hk 2y dy = 1, we have hU. f(x, y) dy] dx = 1. In any partition of 
R = [0, 1] x [0, 1], each rectangle Rj, contains points e with x rational and e with x 
irrational. If in the regular partition of order n we choose ¢ jx = e in those rectangles with 

1 


O<y< i andcj, = e when y > 5, the approximating sums are the same as those for 


2y 


Because g is integrable, the approximating sums must converge to fp gdA = 7/8. However, 
if we had picked all c;; = e all approximating sums would have the value 1. 


11. Fubini's theorem does not apply because the integrand is not continuous nor bounded at 
(0, 0). 


Section 5.3 


1. (a) l/3, both (b) 5/2, both (c) (e — 1)/4, both (d) 1/35, both 


3 A=f", Ma dydx =2 f”, Jr? — x? dx = r?[arcsin | — arcsin(—1)] = z7?. 


5. 28,000 ft? 7. 0 
9. y-simple; 7/2 11. 507 
13. 7/24 


15. Compute the integral with respect to y first. Split that into integrals over [—ó(x), 0] and 
[0, $(x)] and change variables in the first integral, or use symmetry. 


17. Let (R;;) be a partition of a rectangle R containing D and let f be 1 on D. Thus, f* is 1 
on D and 0 on RAD. Let cj, € R\D if Rj; is not wholly contained in D. The approximating 
Riemann sum is the sum of the areas of those rectangles of the partition that are contained 

in D. 

Section 5.4 


1. (a) 1/8 (b) z/4 — (c) 17/12 
(d) G(b) — G(a), wheredG/dy = F(y, y) — F(a, y)anddF/dx = f(x, y) 


3. Note that the maximum value of f on D is e and the minimum value of f on D is 1/e. 
Use the ideas in the proof of Theorem 4 to show that 


1 1 
isaf fœ. y)dA <e. 


646 


Answers to Odd-Numbered Exercises 


5. The smallest value of f(x, y) = 1/(x? + y? + 1) on D is 4, at (1, 2), and so 
1 
i f(x, y)dx dy > -area D =1. 
D 6 
The largest value is 1, at (0, 0), and so 


| f(x, y)dx dy € l-area D = 6. 
D 


9. x (204/10 — 52)/3 


4 
7. zzabc 


1. 43/4 


13. D looks like a slice of pie. 
1 x 42 A/2-x? 
f LU rem] LC rm mo [as. 
o LJo 1 0 
15. Use the chain rule and the fundamental theorem of calculus. 


Section 5.5 


1. 1/3 3. 10 


5. ve+y<z</x?+y2,-f/l-y <x< J/l-y,-l<y<l 


7. 0<z<VJ1-x—y,-J/l—-y <x<VJl-y,-l<y<l 


9. 50x/4/6 M. 1/2 
13. 0 15. a5/20 
17. 0 19. 3/10 
21. 1/6 


23. i LY eco v dz dyds 


2s. f. FT 77 rey adsdyas 


27. ff, f P azaxdy = ff, f(x, y)dx dy 


29. Let M, and m, be the maximum and minimum of f on Be. Then we have the inequality 
m, vol(Be) < ff f;, FAV < M. vol (B.). Divide by vol (B.), let e > 0 and use continuity of f. 


Review Exercises for Chapter 5 


Review Exercises for Chapter 5 


1. 81/2 


5. 81/2 


9. 7/60 


13. In the notation of Figure 5.3.1, 


1. 


If dx dy= [ens 


15. (a)0 (z/24 (90 


17. y-simple; 2z + 2? 


19. x-simple; 73/3 


21. y-simple; 33/140 
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23. y-simple; 71/420. 


25. 1/3 27. 19/3 


29. 7/12 


31. The function f(x, y) = x? + y? + 1 lies between 1 and 2? + 1 = 5 on D, and so the 
integral lies between these values times 4zr, the area of D. 


33. Interchange the order of integration (the reader should draw a sketch in the (u, t) plane); 


Pr FG)dudt — f [ FGodt du — [6 — FG). 


37. The region is the shaded region W in the figure. 


35. 1/12 


Chapter 6 


Section 6.1 649 


The integral in the order dy dx dz, for example, is 


1 pl pl 
Í f f(x, y, z) dy dx dz. 
0 Jz Jinx 


Section 6.1 
1. S =the unit disk minus its center. 
3. D — [0, 3] x [0, 1]; yes 


5. The image is the triangle with vertices (0, 0), (0, 1), and (1, 1). T is not one-to-one, but 
becomes so if we eliminate the portion x* — 0. 


7. Dis the set of (x, y, z) with x? + y? + z? < 1 (the unit ball). T is not one-to-one, but 
is one-to-one on (0, 1] x (0, x) x (0, 27]. 


9. Showing that T is onto is equivalent in the 2 x 2 case to showing that the system 
ax + by =e, cx + dy = f can always be solved for x and y, where 


a b 
A= " 
e d 
When you do this by elimination or by Cramer’s rule, the quantity by which you must divide 
is det A. Thus, if det A Z 0, the equations can always be solved. 
11. Because det A # 0, T maps IR? one-to-one onto R?. Let 77! be the inverse 


transformation. Show that 7~' has matrix 47! and det (47!) = 1/det A, where det A # 0. 
By Exercise 10, P* = T~'(P) is a parallelogram. 


Section 6.2 
1. x(e — 1) 
3. Disthe region 0 <x < 4, įx+3<y<łx+6. (a) 140 (b) —42 


5. D* is the region 0 < u < 1,0 < v < 2; 2(9 — 24/2 — 34/3). 


7 6 64x 
. JT 5 
5 
1. 32/2 13. Ze- 1) 


15. 2a’ 17. $(e-2) 
2 e 


650 


Answers to Odd-Numbered Exercises 


19. 1007/3 21. 2n[V3 — 2 log (1 + V3) + log /2] 
23. 4n log(a/b) 25. 2n|(b^ + Ie?” =(a? + De] 


27. 24 (use the change of variables x = 3u — v + 1, y = 3u + v). 
4 4 
29. (a) zrebe (b) gabe 


31. (a) Check that if T(ui, vi) = T(u», v2), then u, = u» and vi = v2. 
(b) 160/3 


4 
33. 5°? ffr ly, (av?) uv] du dv 


Section 6.3 


1. [x? — sin(x?)]/z? 


11 65 
$: (5) 
5. $503.64 


7. (a) 6, where à is the (constant) mass density. (b) 37/12 


11. 1/4 
k n pas 
13. Letting ô be density, the moment of inertia is f f^ f^ (o sin? ) dp d0 dd. 


15. (1.00 x 105) 


17. (a) The only plane of symmetry for the body of an automobile is the one dividing the 
left and right sides of the car. 

(b) Z. Ww 5(x, y, z) dx dy dz is the z coordinate of the center of mass times the mass 
of W. Rearrangement of the formula for Z gives the first line of the equation. The next step is 
justified by the additivity property of integrals. By symmetry, we can replace z by —z and 
integrate in the region above the xy plane. Finally, we can factor the minus sign outside the 
second integral, and because ó(x, y, z) = 5(u, v, —w), we are subtracting the second integral 
from itself. Thus, the answer is 0. 

(c) In part (b), we showed that z times the mass of W is 0. Because the mass must be 
positive, Z must be 0. 

(d) By part (c), the center of mass must lie in both planes. 


19. V = —(4.71 x 109)Gm/R ~% —(3.04 x 10°)m/R, where m is the mass of a test 
particle at distance R from the planet’s center. 


Section 6.4 651 


Section 6.4 
1.4 3. 3/16 
5. (a) 3x (b) 4«1 


7. Integration of ff e™™ dx dy with respect to x first and then y gives log 2. Reversing the 
order gives the integral on the left side of the equality stated in the exercise. 


9. Integrate over [e, 1] x [e, 1] and let € — 0 to show the improper integral exists and 
equals 2 log 2. 


2 
11. a ahy — g$ L1] 


13. Use the fact that 
sin? (x — y) - 1 
JÁi-xy yl- y 


15. Use the fact that eè? f(x — y) = 1/(x — y) on the given region. 


17. Each integral equals 1/4, and Theorem 3 (Fubini's theorem) does apply. 


Review Exercises for Chapter 6 


LerG=¢ DO SG) 
(b) Sfp fŒ y) dx dy — 4 ff, fQu + v, 2v) du dv 


3. 3 (Use the change of variables u — x? — y?, v — xy.) 
1 
5. qnaa - 1 7. (51/2415 


9. abc/6 

11. Cut with the planes x + y +z = Jkjn, 1<k <n-—1,k an integer. 
13. (25 + 10/5)x/3 
15. (e — e7!)/4 (Use the change of variables u = y — x, v = y + x.) 
17. (9.92 x 105)z grams 


19. (a) 32 
(b) This occurs at the point of the unit sphere x? + y? + z? = 1 inscribed in the cube. 


21. (0,0, 3a^/5) 


23. 4m In(a/b) 
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25. n/2 

27. (a) 9/2 (b) 647 

29. Work the integral with respect to y first on the region D.; = ((x, y)le < x x L, 

0 < y < x} to obtain /,; = Sou fdxdy= k x^? (1 — e™) dx. The integrand is 
positive, and so J,,, increases as £ > 0 and L — oo. Bound | — e~* above by x for 

0 <x «land by | for 1 < x < co to see that /,,, remains bounded and so must converge. 
The improper integral does exist. 


31. (à) 1/6 — (b) 167/3 


33. 2n 


Section 7.1 
1 
1. f fe. yds = f, SOO, xO. (Ile (I dr = f, 0- 14r — 0. 
3. (a) 2 (b) 52/14 
1 2532 4 lo 
5. —z( + 1/2? + - 9/7) 
3 3 
7. (a) The path follows the straight line from (0, 0) to (1, 1) and back to (0, 0) in the xy 


plane. Over the path, the graph of f is a straight line from (0, 0, 0) to (1, 1, 1). The integral is 
the area of the resulting triangle covered twice and equals 4/2. 


28 zx 
o- [2 tî) when —1<1<0 


b 
©) 42(1--1*) when O<t<l. 


The path is 


(y= (i —s//2)(1, 1) when 0O<s<vV2 
~ iG/V2—1)0,) when V2 <5 <2V2 


and f. fds = J/2. 

9. 2a/a 
11. (a) [2V5 + log(2+V5)1/4 — (b) (54/5 — 1)/[6V5 + 3 log(2 + V5)] 
13. The path is a unit circle centered at (0, 0, 0) in the plane x + y +z = 0 and so may be 
parametrized by c(0) = (cos @)v + (sin @)w, where v and w are orthogonal unit vectors laying 
in that plane. For example, v = (1/4/2)(—1, 0, 1) and w = (1/4/6)(1, —2, 1) will do. The 


total mass is 27/3 grams. 


15. Choosing either c(t) = (t?, 1, 0) or e(t) = (1, t?, 0), 0 < t < 1 will do. 


Section 7.2 653 


Section 7.2 
1. (3/2 (0 (@0  (d147 
3. 9 


5. By the Cauchy-Schwarz inequality, |F(e(t)) - e’(t)| < ||F(c(£))l| |c'(£)]| for every t. Thus, 


[Fas 


b b 
[ rew «i < | reos 


a 


b b 
<f MECHI Ie (1 dt < mf lle'()|| dt = MI. 


7. 5 — (n — D/(n +) 


oO BIW 


9. 
11. The length of c. 


13. If c'(t) is never 0, then the unit vector T(t) = c'(r)/||e'(t)|| is a continuous function of t 
and so is a smoothly turning tangent to the curve. The answer is no. 


15. 7 


v I. 
17. Use the fact that F is a gradient to show that the work done is ER independent of 
2 1 
the path. 


19. (a) UoI 
(b) f has a positive derivative; it is one-to-one and onto [0, L] by the mean-value and 
intermediate-value theorems. It has a differentiable inverse by the inverse function theorem. 
(c) gs) = 1/lle’(x)|| where s = f(x). 
(d) By the chain rule, b'(s) = c'(x) - g'(s), which has unit length by part (c). 
Section 7.3 
1. z=2(y—1)+1 
3. 182 1) - 4(y +2) (x — 13) =0 or 1827 —4y -x — 132 0. 


5. The vector n — (cos v sinu, sin v sinu, cosu) — (x, y, z). The surface is the unit sphere 
centered at the origin. 


7. n = —(sin v)i — (cos v)k; the surface is a cylinder. 
9. (a) x = xo + (y — yo(8h/8y)Cyo, zo) + (z — zo)(8h/8z)( yo, zo) describes the plane 


tangent to x = /i( y, z) at (xo, Yo, zo), Xo = (yo, zo). 
(b) y = yo + (x — xoY(8K/8x (xo, zo) + (z — zo)(8k/0z)(xo, zo). 
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11. z—6x — 8y 4-320 


13. (a) The surfaces is a helicoid. It looks like a spiral ramp winding around the z axis. (See 
Figure 7.4.2.) It winds twice around, since 0 goes up to 4r. 

(b) n= &(1/4A1 4- r?)(sin6, — cos6, r). 

(c) yox — xoy + (xg + Y9)z = (x3 + Y9)zo- 

(d) If (xo, yo, zo) = (ro, COS 6o, ro sin 6, 6), then representing the line segment in the 
form ((r cos 6o, r sin 8o, 69)|0 < r < 1) shows that the line lies in the surface. Representing 
the line as ((xo, tyo, Zo)|0 < t  1/(x + y2)) and substituting into the results of part (c) 
shows that it lies in the tangent plane at (xo, yo, zo). 


15. (a) Using cylindrical coordinates leads to the parametrization 


B(z, 0) = (V25 + z? cos 0, V 25 + z? sin6, z), —oo«z«oo, 0x0 «2x 


as one possible solution. 

(b) n = (V25 + z? cos6, 4/25 + z? sin 0, —2)/4/25 + 2z?. 

(c) xox + yoy = 25. 

(d) Substitute the coordinates along these lines into the defining equation of the surface 
and the result of part (c). 


17. (a) ub u, v e v, u e i), and v > v? all map R onto R. 

(b) T, x T, = (0, 0, 1) for $, and this is never 0. For the surface 4», T, x T, = 
9u?v?(0, 0, 1), and this is 0 along the u and v axes. 

(c) We want to show that any two parametrizations of a surface that are smooth near a 
point will give the same tangent plane there. Thus, suppose ®: D C IR? — IR? and 
V: B C R? — R? are parametrized surfaces such that 


SP(uo, vo) = (xo. Yo. zo) = W(so, fo) 0) 
and 


#0 ad — (TY xT?) 


b © u 
(r: x T?) - #0, (ii) 
so that ® and W are smooth and one-to-one in neighborhoods of (uo, vo) and (so, to), which 
we may as well assume are D and B. Suppose further that they “describe the same surface,” 
that is, (D) = (B). To see that they give the same tangent plane at (xo, yo, zo), show 
that they have parallel normal vectors. To do this, show that there is an open set C with 
(uo, vo) € C C D and a differentiable map f: C — B such that (u, v) = P( f(u, v)) for 
(u, v) € C. Once you have done this, computation shows that the normal vectors are related 
by T? x T? = [9(s, r)/8(u, v)]T7 x T7. 

To see that there is such an f, notice that since T? x T} ¥ 0, at least one of the 2 x 2 
determinants in the cross product is not zero. Assume, for example, that 


(50.10) 


ax dy 
ðs ðs 

0. 
ox ay # 
Ot ðt 


Now use the inverse function theorem to write (s, t) as a differentiable function of (x, y) in 
some neighborhood of (xo, yo). 
(d) No. 
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Section 7.4 


1. 4x 3. ETT ES log (1 + V2)] 


5; zr (6v6 —8) 


7. The integral for the volume converges, whereas that for the area diverges. 


9. A(E) = j^ al J " Jab? sin? cos? + bc? sin’ cos? 0 + a2c? sin? $ sin? 0 do dO 


11. (z/6)(5V/5 — 1) 
13. (x/2)/6 
15. 44/5; for fixed 6, (x, y, z) moves along the horizontal line segment y — 2x,z — 0 from 


the z axis out to a radius of 4/5| cos 0| into quadrant | if cos@ > 0 and into quadrant 3 if 
cosé <0. 


17. (x 4-2)/(1 — 2) 
19. z(a 4- b)/1 + m*(b — a) 


21. av —8V2+1) 


23. With f(x, y) = y R? — x? — y?, (4) becomes 


x? + y? 
A(S’) = > + 1 dx d; 
(5) If qure xdy 


R 
=f ff p-gcy F 


where D is the disk of radius R. Evaluate using polar coordinates, noting it is improper at the 
boundary, to get 27 R2. 
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Section 7.5 
1. 3425 3. za? 
24 
5/5 1 
5. (a) Vin RR? (b) 20 R? y Of» T 
4V 3 "D 
9. 16x R3/3 


11. (a) The sphere looks the same from all three axes, so these three integrals should be the 
same quantity with different labels on the axes. 

(b) 4x R*/3 

(c) 4x R*/3 
13. (R/2, R/2, R/2) 
15. (a) Directly compute the vector cross product T,, x T, and then calculate its length and 
compare your answer to the left-hand side. 

(b) In this case, F = 0, so A(s) = ffp VEG du dv. 

(c) 4xa?. 
17. Leta = 8x /8u, b = dy/du, c = 0x/dv, and d = dy/dv. The conditions (a) and (b) in 
Exercise 16 are then a? + b? = c? + d? and ac + bd = 0. Show that a # 0 and, by a 
normalization argument, show that you can assume a = 1. Now calculate further. 


19. 2a? 


Section 7.6 
1. +487 (the sign depends on orientation) 
3. 4n 
5. 2z (or —27,, if you choose a different orientation) 
7. 2n 
9. 127/5 


11. With the usual spherical coordinate parametrization, T; x T; = — sing r (see 
Example 1). Thus, 


[ [r5 ff E-E, x 0404 = f [a-smo20d0 


2n pm 
= [ ji F, sinġ dọ dé 
o Jo 


Section 7.7 


and 


f [r5 [C [ rosso. 


13. Fora cylinder of radius R = 1 and normal component F,, 


b Qn 
[ [a= | f F, d dz. 
s a 0 


15. 27/3 


17. žaber 


Section 7.7 


1. Apply formula (3) of this section and simplify; H = 0 and K = —b? /(u? + P?y.. 


3. Apply formula (3) of this section and simplify. 


—4a®b® 
5. K = ———_. | 
(a^b* + 4b4u? + 4a^v? 
7. Apply formula (3) of this section and simplify. 
9. Apply formula (2) of this section and simplify; K = —A"/[(1 + (h'Y)?A]. 
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1. (a) 3V2(1 — e)/13 (c) (236, 1584/26 — 8)/35 - (25) 
(b) —x 42/2 (d) 842/189 


3. (22/141 (b) -1/2 
5. 2a3 


7. (a) A sphere of radius 5 centered at (2, 3, 0); (6, 5) = (2+ 5cos0 sing, 
3+45sin6 sing, 5cos¢);0 < 0 «21;0xó x x 
(b) An ellipsoid with center at (2, 0, 0); (6, ¢) = (2+ (1/4/23 cos sing, 
3sin@ sing, 3 cos); 0 <6 < 27,0 < <x 
1 
(c) An elliptic hyperboloid of one sheet; ®(@, z) = (378 + 2z? cos@, 


1 
VEFA sint, s): 0 < 0 < 2n,—00 <z co 
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l Qu " : Ex 
9. A($)— 2 fe 4/3 cos? 0 + 5d6; describes the upper nappe of a cone with elliptical 


horizontal cross sections. 


11. 114/3/6 
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Answers to Odd-Numbered Exercises 


13. 42/3 
15. 54/5/6 


17. (a) (e? coszzz, xe" cosmz, —7 xe" sin 7z) (b) 0 


1 
19. AG +1) 


21. n 2 (1//5(-1,0,2,2z x = 1 
23. 0 


25. If F = V, then V x F = 0 (at least if ¢ is of class C?; see Theorem 1, Section 3.4). 
Theorem 3 of Section 7.2 shows that ie V6 - ds = 0 because c is a closed curve. 


27. (a) 24m (b) 24x (c) 60x 


29. (a) [V R* + p(zo —z)l/P (b) VV 2z0(R? + p?)/p?g 


Section 8.1 
1. -8 


3. (a) 0 (b) =x R? (c) 0 (d) =x R? 


5. 3a? 7. 37/2 
9. 37(b? — a?)/2 11. (a) Both sides are 27r. (b) 0 
13. 0 15. mab 


17. A horizontal line segment divides the region into three regions of which Green's theorem 


applies; now use Exercise 8 or the technique in Figure 8.1.5. 

19. 9x/8 

21. Ife > 0, there is aô > 0 such that |u(q) — u(p)| < € whenever ||p — q|| = p < ô. 
Parametrize à B,(p) by q(0) = p + p(cos6, sin 6). Then 


2n 
IZ(p) — 2zw(p)l «f lu(q(8)) — u(p)| d6 < 2ze. 
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23. Parametrize 0 B, (p) as in Exercise 21. If p = ( pi, p2), then /(p) = Jet u(p + p cos, 
pa + p sin6) d6. Differentiation under the integral sign gives 


I 2n 22 it b] 1 
a-f Vu (eos6, sind) a6 = | vu-ndo =~ f ESI VudA 
dp 0 0 p Jap, On PJ JB, 


(the last equality uses Exercise 22). 


25. Using Exercise 24, 


R pn 
I udA =f f u[p + p(cos 8, sin@)]p d8 dp 
Br o Jo 


R R 
= m n 
EE f (L, u ise - [ 2zpu(p) dp = z R°u(p). 


27. Suppose u is subharmonic. We establish the assertions corresponding to Exercises 26(a) 

and (b). The argument for superharmonic functions is similar, with inequalities reversed. 
Suppose V?u > 0 and u(p) > u(q) for all q in Ba(p). By Exercise 23, /'(p) 7 0 for 

0 < p x R, and so Exercise 24 shows that 27u(p) < [(p) x I(R) for0 < p < R. If 

u(q) < u(p) for some q = p + p(cos 6o, sin 6o) € Br(p), then, by continuity, there is an arc 

[6 — 8, @ + 8] on 8 B, (p) where u < u(p) — d for some d > 0. This would mean that 


2m 
2zu(p) < I(p) = =f u[p + p(cos@, sin@)]p dé 
0 


< (2x — 28)u(p) + 28[u(p) — d] < 2xu(p) — 28d. 


This contradiction shows that we must have u(q) = u(p) for every q in Bs(p). 

If the maximum at p is absolute for D, the last paragraph shows that u(x) = u(p) for all 
x in some disk around p. If c: [0, 1) — D is a path from p to q, then u(c(t)) = u(p) for all t in 
some interval [0, b). Let bo be the largest b € [0, 1] such that u(e(t)) = u(p) for all t € [0, b). 
(Strictly speaking, this requires the notion of the least upper bound from a good calculus text.) 
Because u is continuous, u(c(5o)) = u(p). If bo 1, then the last paragraph would apply at 
c(bo) and u is constantly equal to u(p) on a disk around c(5;). In particular, there is a ô > 0 
such that u(e(t)) = u(c(bo)) = u(p) on [0, bo + 5). This contradicts the maximality of bo, so 
we must have bo = 1. That is, e(q) = c(p). Because q was an arbitrary point in D, u is 
constant on D. 


29. Assume V?u, = 0 and V?u; = 0 are two solutions. Let @ = u, — u2. Then V?$ = 0 and 
(x) = 0 for all x € 3D. Consider the integral ff, $V^$ d 4 = — ffp V$ - Vb d A. Thus, 
[fp V$ - V6 d A = 0, which implies that Vp = 0, and so ¢ is a constant function and hence 
must be identically zero. 
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Section 8.2 

1. —2x 

3. Each integral in Stokes’ theorem is zero. 

5.0 7. —4n/J3 
9. 0 11. +27 


13. Using Faraday’s law, ff.[V x E + 3H/8r] dS = 0 for any surface S. If the integrand 
were a nonzero vector at some point, then by continuity the integral over some small disk 
centered at that point and lying perpendicular to that vector would be nonzero. 


15. The orientations of dS, = dS, must agree. 
17. Suppose C is a closed loop on the surface drawn so that it divides the surface into two 
pieces S; and 55. For the surface of a doughnut (torus) you must use two closed loops; can 


you see why? Then C bounds both S, and 55, but with positive orientation with respect to one 
and negative with respect to the other. Therefore, 


[[vsv-«- [[ vxF-ds+ ff vxF-ds= [ F-ds— f F-ds=0. 
s 5 $ [c c 


19. (a) IfC = 8S, f. v- ds = ff. (V x v): d$ = ff,0-ds — 0. 

(b) fov-ds= yd v-e'(t)dt =v- fe. e'(t)dt = v - (c(b) — c(a)), where e: [a, b] > R? 
is a parametrization of C. (The vector integral is the vector whose components are the 
integrals of the component functions.) If C is closed, the last expression is 0. 

21. Both integrals give 7/4. 23. (a) 0 (b) x (c) x 


25. —20z (or 207 if the opposite orientation is used) 


27. One possible answer: The Móbius curve C is also the boundary of an oriented surface S 
the equation in Faraday's law is valid for this new surface. 


Section 8.3 


1. If F = Vf = Vg and C is a curve from v to w, then (f — g)(w) — (f — g)(v) = 
Ic Vf — g)-ds = 0 and so f — g is constant. 


3. x?yz — cosx +C 
5. Yes, it is the gradient of g(x, y) = F(x) + F(y), where F'(x) = f(x). 


1 1 
7. No; V x F = (0,0, —x) £0. 9. esinlt e - 5 


11. 3.5 x 10? ergs 
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1 
13. (a) f=x?/24+y?/2+C () f= 3" +xy?+C 
(b) F is not a gradient field. 


15. Use Theorem 7 in each case. 


(a) —3/2 (b) —1 (c) cos(e?) — cos(1/e)/e 


1 1 1 
17. (a) No. (b) Ge. xy—z, vy) or (zz — 2xyz — 3^ E55 0) 


1 
19. 365 Tx) yk) 
21. (—zsin y + y sinx, xz cos y, 0) (Other answers are possible.) 


23. (a) Vx F —(0,0,2) Z0 

(b) Let c(r) be the path of an object in the fluid. Then F(c(7)) = c'(t). Let 
c(t) = (x(t), y(t), z(t). Then x’ = —y, y' = x, and z' = 0, and so z is constant and the 
motion is parallel to the xy plane. Also, x" +x = 0, y" + y = 0. Thus, x = A cost + B sint 
and y = C cost + Dsint. Substituting these values in x’ = —y, y' = x, we get 
C = —B, D = A, so that x? + y? = A? + B? and we have a circle. 

(c) Counterclockwise 


GmM " 

25. (a) = TG yt r Ara 
ike xy? t2?—-3x? xy Hazy x? +y? +2? — 32? 
` RN (G2 3 y? zn (x2 + y? + zr (x2 + y? +22)972 


=0 


(b) Let S be the unit sphere, 5; the upper hemisphere, Sz the lower hemisphere, and C 
the unit circle. If F = V x G, then 


f [r7 ff o ff ras- [0m [6-0 


But ff, F -dS = —GmM ff.r/Mrl?): ndS = —4z GmM, because ||r|| = 1 and r = n on 
S. Thus, F — V x G is impossible. This does not contradict Theorem 8 because F is not 
smooth at the origin. 


Section 8.4 

1. 4x 

3. 3 

5. (a0 (54/15 (c) —4/15 


T6 
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9.1 
11. Apply the divergence theorem to fF using V - (f F) = Vf -F + fV-F. 
13. If F = r/r?, then V- F = 1/r?. If (0, 0, 0) ¢ Q, the result follows from Gauss’ 


theorem. If (0, 0, 0) € Q, we compute the integral by deleting a small ball 
B, = {(x, y, Z)I(x? + y? +27)? < €} around the origin and then letting £ > 0: 


Tf zav = tini f f za - lini ff rds 
or £0 ABe T £0 AAB) P 
= ang ( [Pas ff Sas) nm fS ase) 
£0 9B. £0 ap r 
l 


The integral over ðB, is obtained from Theorem 10 (Gauss' law), because r = € everywhere 
on B,. 


15. Use the vector identity for div( f F) and the divergence theorem for part (a). Use the 
vector identity V -(f Vg — gV f) = f V?g — gV? f for part (b). 


17. (2) IFØ) = [fy e(@)/(4zllp — all) d V (q), then 


Vo(p) = I" p Xay/An vU Mp = ql) 4V(q) 


- f f | oare — a/p- aiaro. 


where Vp means the gradient with respect to the coordinates of p and the integral is the vector 
whose components are the three component integrals. If p varies in V U dV and n is the 
outward unit normal to dV, we can take the inner product using these components and collect 
the pieces as 


pq 1 
Vé(p)-n- — J[[ 4 A pc qp - 9 ndr). 


Thus, 


Vé(p)-ndV(p) = &Q _} _ _q).ndq) aro). 
I, “ih, Ith, 4x |lp— all 


There are essentially five variables of integration here, three placing q in W and two placing p 
on dV. Use Fubini's theorem to obtain 


[[, om - f[[[ 92] [[, Sota |n 
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If V is a symmetric elementary region, Theorem 10 says that the inner integral is 4z if q € V 


and 0 if q ¢ V. Thus, 
J| vo as - [ [[, omar. 


Because p = 0 outside W, 


f| venas - - [[[ roro. 


If V is not a symmetric elementary region, subdivide it into a sum of such regions. The 
equation holds on each piece, and, upon adding them together, the boundary integrals along 
appropriately oriented interior boundaries cancel, leaving the desired result. 

(b) By Theorem 9, ffy V -dS = fff, V^$ dV, and so fff, Vay = 
— fff, p d V. Because both p and V’¢ are continuous and this holds for arbitrarily small 
regions, we must have V7¢ = — p. 


19. If the charge Q is spread evenly over the sphere S of radius R centered at the origin, the 
density of charge per unit area must be Q/47 R?. If p is a point not on S and q € S, then the 
contribution to the electric field at p due to charge near q is directed along the vector p — q. 
Because the charge is evenly distributed, the tangential component of this contribution will be 
canceled by that from a symmetric point on the other side of the sphere at the same distance 
from p. (Draw the picture.) The total resulting field must be radial. Because S looks the same 
from any point at a distance ||p|| from the origin, the field must depend only on radius and be 
of the form E = f(r)r. 
If we look at the sphere X of radius ||p||, we have 


(charge inside £) = [fe-as= ff f(iplr-nds 


= f(lipli)lipll area © = 4 lip]? F(p- 
If ||p|| < R, there is no charge inside 2; if ||p|| > A, the charge inside X is Q, and so 


I OQ . 
E) = ax pe? if |pl >R 


0 if |pl| < R. 


21. By Theorem 10, ffy F« dS = 4 for any surface enclosing the origin. But if F were the 
curl of some field, then the integral over such a closed surface would have to be 0. 


23. If S 2 QW, then ff, r-ndS = fff, V-rdV = fff, 3dV — 3 volume (JW). For the 
geometric explanation, assume (0, 0, 0) € W and consider the skew cone with its vertex at 


1 
(0, 0, 0) with base AS and altitude ||r||. Its volume is zS +n). 
Section 8.5 


1. Write the components of as £(x, f), n(x, t), and ¢(x, t). First, observe that by definition 
of o, 


Zoa, D = FO, 1) 0). 
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The determinant J can be differentiated by recalling that the determinant of a matrix is 
multilinear in the columns (or rows). Thus, holding x fixed, 


8 8E an at ae 200p c ae an OO 
ðt dx dx Ox Ox dtdx ax Ox dx dt dx 
ð z _| 99€ dn OF), dE Gdn a|, |ə dn A aE 
ar atdy dy ay dy dtdy ay dy dy ðtðy 
ð ð dn Or ə 8 0p OC aE dn O0 
ðt əz Oz dz dz Ot0z dz dz Oz <dtdz 
Now write 
3906 9005 ə 
Bon ^ eps ag ODD 
9 0E ð 8E ð 
Ta eneh st), t), 
ot dy ðyəðt ay 2000.9, D. 
aa aa a 
araz T az ar T az OADD. 


The components F;, F>, and F; of F in this expression are functions of x, y, and z through 
g(x, t); therefore, 

dF ðk | AF, An , OF, ðt 

dE Ox | On ðx dE Ax’ 


a, Pers 0.0) 
X 


0F,0E  0F,0p  0Fji0C 
ð Oz ðn Oz Ot Oz. 


a; Pie. D, D 
z 


When these are substituted into the previous expression for 3J /3t, one gets for the respective 
terms 


ari. OR. OF; 
ty 4 J+ Sy = (div PJ. 
Ox oy Oz 


3. Hints: By the transport equation from Theorem 12, with V in place of F, 


d Dp f 
afff erwe- fff, (2 pd v dcdyas 


Now use the fact that 


Dp 


à 
= +pdiv V =div3+ 2, 


where J = pV, as in the text. 


Section 8.5 665 


5. If v; is the ith component of a vector v, then by the transport equation (Exercise 2), 


|a J[[ menn] - ff [UPrdcdyde= 7 f f| rrasava: 
- fff, [527 + (GE) dv F] dx dy de 


= ii [0+ D,(fF;)-F + (fF;) div r] dx dy dz 
W, 


- fl [ro + vum + aye F] ax dyads 
W 


z JIT. [5,000 «muni + (fF) div F} } dx dyaz 


llf, E + D(/F)F 4 aive] dx dy dz 


-[//f, UP HDPE + (JF) div Edi dy de] 


UIT, (irm e- V)(fF) + (fF) div r) dxdyaz]. 


7. (a) Because V = V¢, V x V = 0, and therefore (V - V)V = ;Vüvib, Euler's 


equation becomes 


Vp dV | r d$ dos. 
-Æ = — + viv) = v(— + cvi J. 
a ata (Ivi (en I 


If c is a path from P to P», then 


dp 1 
-H 4p--[; Vp-e(t)dt = [v(B+pie)-eoa 


— (46, Lyp 
= (7 «awe 


(b) nmi = 0 and p is constant, then 4 V(|V|?) = —(Vp)/p = —V(p/p), and 
therefore V (iive + vip) = 


P 


9. By Ampére’s law, V -J = V-(V x H) — V-(8E/8t) = —V - (dE/dt) = 
—(8/8t)(V - E). By Gauss’ law this is —dp/dt. Thus, V - J + dp/dt = 
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Section 8.6 
1. (a) Qxy? — yx?) dx dy (d) (xy 4- x?) dx dy dz 
(b) (x? + y?) dx dy (e) dx dy dz 


(c) (x? + y? +2") dx dy dz 


3. (a) 2xy dx + (x? + 3?) dy (e) 2x dx dy dz 


(b) —(x + y? sinx)dx dy (f) 2y dy dz — 2x dz dx 
Axy 

(c) —(2x + y) dx dy (g) xp dy 

(d) dx dy dz (h) 2xy dx dy dz 


5. (a) Form; (aV, + V2) = Form; (aA, + A, @B, + Bj, aC; + Co) 
= (@A, + An) dy dz + (a Bı + By) dz dx 
+ (aC; + C3) dx dy 
= a(A; dy dz + By dz dx + Cy dx dy) 
+ (42 dy dz + B) dz dx + Ca dx dy) 
= a Form (Vi) + Form(V2) 


aA aA A 
(b) dw = (Fax + y dy + aede) ^ dx + A(dx). 


8B 8B B 
+ (—dx+—dy+ 2P i, A dy + B(dyy 
Ox oy az 


ac ac ac 
+ dx + — dy4 dz) ^ dz + C(dzy. 
ax oy az 


But (dx)? = (dy)? = (dz = dx ^ dx = dy ^ dy = dz ^ dz «0, dy ^ dx 
— dx ^ dy, dz ^ dy = — dy ^ dz, and dx ^ dz = —dz ^ dx. Hence, 


ac OB 0A ac OB 9A 
do = (5— — dy dz 4 ( 5 — — Jazdx  ( £5 E ) ax ay 
i le de ) ae 3 i sala J ui 


= Form; (curl V). 


7. An oriented 1-manifold is a curve. Its boundary is a pair of points that may be considered 
a 0-manifold. Therefore, w is a 0-form or function, and NE m do = o(b) — oa) if the curve 
M runs from a to b. Furthermore, dw is the 1-form (dw/dx) dx + (dw/dy) dy. Therefore, 
Ju do is the line integral [,,(d@/dx) dw + (9/ dy) dy = fy Vw- ds. Thus, we obtain 
Theorem 3 of Section 7.2, fy Vo- ds = ob) — w(a). 


9. Put w = F dx dy + F; dy dz + F; dz dx. The integral becomes 


She- HE [J (CE +28 +28) nn 


(a)0 — (b) 40 


11. Consider w = x dy dz + y dz dx + z dx dy. Compute that dw = 3 dx dy dz, so that 


5 Sloe o= 5 Sl ln do = [Jy dx dy dz = vR). 
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Review Exercises for Chapter 8 


1. 


3. 


5. 


7. 


(a) 2ra? (0 

0 

(a) f—-x'/A-xy (b) —1/4 

(a) Check thatVxF=0 (b) f —3x!ycosz +C (c) 0 


23/6 


. No: V x (ax r) = 2a 


13. (a) Vf —3ye"i-3xe"j-F6xyze"k — (b) 0 (c) Both sides are 0. 

15. 87/3 

17. za?/4 

19. 21 

21. (a) G is conservative; F is not. 
(b G= Vo if ó = (x*/4) + (y*/4) - ny + 5? + C, where C is any constant. 
(c) J.E- ds=0; [,G-ds=—5; fF ds = y he- ds=—5 

23. Use (V - F)(xo, yo. Zo) = limit, o f ao, F -ndS from Section 8.4. 


Index 


2 x 2 matrix, 38, 39, 77 

3 x 3 matrix, 39, 77 

1,, 81 

e’s and 6’s limits, 121—125 

n-dimensional euclidian space, 74 

n-space vectors, 74 

R^,3 

x axis, 1 

x coordinate, 3 

x-simple regions, 347, 519 

y axis, 1 

y coordinate, 3 

y-simple domain, 407 

y-simple regions, 341, 344, 
345, 519 

zaxis, | 

z coordinate, 3 

0-form, 589, 590 

1-form, 590 

2-form, 590-592 

3-form, 592 

Bi symbol, xxviii 

A symbol, xxviii 


absolute maximum, 219-222, 
234, 235 

absolute minimum, 219—222, 
234, 235 

absolute value, xxvi 

acceleration, 261, 263-266 

action at a distance, 507 

additive inverse, 3 

adiabatic process, 450 

airplane, 37 

Alexandov, 504 

algebra of forms, 597—601 

al-Khuwarizmi, xvii 

Ampère’s law, 446, 447, 493, 549, 
550, 582 

analytic function, 202 

Andromeda galaxy, 507 
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angle between two vectors, 27-29 
angular velocity vector, 300 
anticommutativity, 597 
Appolonius, xv 
Arabian mathematics, xvii 
arc length 
definition, 274, 275, 279 
determination, 275, 276 
differential, 278, 279 
formula justification, 280, 281 
function, 279 
reparametrization, 282 
Archimedes, xvii, 398, 467, 468 
area 
curl as circulation per unit area, 
539-542 
Green’s theorem, 524 
surfaces, 461—467 
associativity, 4, 59, 83, 597 
automobile shell, 405 
average value, 428 
average value of a function, 
393-395 


Babylonian mathematics, xiii 
ball 
volume of, 390 
barn, 348 
basic 1-forms, 590 
basic 2-forms, 590 
basic 3-forms, 592 
beam, 419, 420 
Bernoulli, Jacob, 66 
Bernoulli, Johann II, 189, 204 
Bernoulli’s law, 586 
best linear approximation, 134 
binormal vector, 283 
bird, 35 
Bolyai, 85 
bordered Hessian determinant, 
239, 241 


boundary curve, 532 
boundary points, 110, 111 
bounded function, 327 
integratability, 330—332 
bounded set, 220 
brachistrochone, 429 
Buys-Ballot's law, 315 


calculus of variations, 429 
capped cylinder, 547 
Cardano, Gerolamo, 54-56 
Cartesian coordinates, 1, 2 
Cartesian product, 317 
Cauchy, Augusten-Louis, 42, 55, 
56, 228, 469 
Cauchy-Riemann equations, 483 
Cauchy-Schwartz inequality, 
29-33, 76 
Cavalieri, Bonaventura, 322 
Cavalieri's principle, 320-322 
Cayley, Arthur, 86 
center of gravity, 481 
center of mass, 93, 395-399 
centripetal force, 266 
chain rule, 150, 152, 153, 
156—158, 185, 186, 189, 227, 
249, 262, 534, 542 
first special case, 153, 154 
general case and proof, 153, 
155, 156 
second special case, 154, 155 
change of variables formula, 
368, 381 
applications, 393-404 
cylindrical coordinates, 388 
double integrals, 382, 383 
Gaussian integral, 385, 386 
polar coordinates, 383—385 
spherical coordinates, 388-390 
triple integrals, 387-390 
change of variables theorem, 376 


Index 


changing the order of integration, 
349-352 
charge density, 576 
chemical equation, 4 
circular orbit, 265—267 
circulation, 446, 539—542 
circulation and curl, 539—542 
class C! functions, 138 
class C? functions, 182 
Clifford, W. K., 421, 508 
Cobb-Douglas production 
function, 246 
closed curve, 442 
closed interval, xxvi 
closed set, 220 
closed surface, 549 
commutative, 81 
complex numbers, 55—57 
component curves, 444 
component scalar fields, 285 
composition, 120, 121 
conductivity, 287, 581 
cone, 456, 463, 473 
conformal parametrization, 
482, 514 
conic sections, xiv 
conservation of energy, 289, 290, 
581, 582 
conservation laws, 576—578 
conservative fields 
definition, 551 
gradient identification, 551—553 
physical interpretation, 
553-556 
planar case, 556—558 
constant multiple rule, 151 
constant vector field, 607 
constrained extrema, 225-232 
Lagrange multiplier method for 
several constraints, 232—235 
second derivative test, 239—243 
continuity 
of compositions, 121 
definition, 118, 119 
theorems, 137, 138 
continuous functions, 117-119 
properties, 119, 120 
coordinates, 1—3 
Copernicus, Nicolaus, xxi 


Coulomb’s law, 288, 293, 494 
Cramer's rule, 42 
critical points, 208-210, 215, 221, 
222, 226, 240 
cross product, 38, 43—47, 54, 299 
cross product rule, 262 
cross section of a beam, 419 
cross section of a torus, 472 
cross-sectional area, 320, 321 
cubic equations, 54 
curl, 539—543 
definition, 299—300 
divergence, 304 
gradients, 302, 303 
rotation association, 300, 301 
rotational flow, 301, 302 
scalar curl, 303, 304 
curvature, 425, 425, 426, 500 
hemisphere, 502—504 
planes, 501 
surfaces of constant curvature, 
504, 505 
curves, 141—144, 276 
components, 444 
integral of 1-forms over, 
593, 594 
knotted, 426 
line integrals over, 441—444 
total curvature, 425, 500 
cyclicly permuting, 44 
cyclist, 449 
cycloid, 144 
cycloidal path, 146 
cylinder, 326 
capped, 547 
cylindrical can, 244 
cylindrical hole, 473 
cylindrical coordinates, 66-68 
change of variables, 388 
Stokes’ theorem, 542, 543 
cylindrical hole, 473 


d’Alembert, LeRond, 189 
Darboux, 471 

Da Vinci, Leonardo, xx, 399 
definite integral, xvii 
degenerate critical point, 217 
Delauney, 505 

del Ferro, Scipione, 54 
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del operator, 294, 307 
density 
charge, 576 
energy, 581 
mass, 576 
derivative of a function, xvii 
derivative of a k-form, 598, 599 
derivative of a 3-form, 599 
derivative operator, 294 
derivatives 
directional derivatives, 164—166 
gradients, 136, 137 
partial derivatives, 
128-131, 135 
properties, 150—152 
Descartes, René, 83, 84 
determinants 
geometry of, 48—50 
matrix, 38, 39 
properties, 40—42, 82 
determinant test for positive 
definiteness, 215-219 
Dieterici's equation, 161 
differentiability, 130-132 
functions of two variables, 133 
general case, 134—136 
tangent plane, 133, 134 
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SYMBOLS ARE LISTED IN ORDER OF THEIR APPEARANCE IN THE TEXT 


SYMBOL 


R 
la, b] 
(a, b) 
[a, b) 
(a, b] 
lal 


axb 


y2 

ff, £44 = ff, fle») ax ay 

fff, fav = fff, for. 2 dx dy dz 
8G. y) 


= Fu, v) 
Cop 


Sofas 
JE- ds 
UNES 
ff, F - aS - ff, F - nas 


NAME 


real numbers 

closed interval (x | a < x x b) 
open interval (x | a « x « 5) 
half-open interval (x | a € x < b} 
half-open interval (x | a < x x b} 
absolute value of a 

rational numbers 

n-dimensional space 

standard basis in R? 

norm of a vector a 

inner product of the vectors a and b 
cross product of the vectors a and b 
cylindrical coordinates 

spherical coordinates 

disk of radius r about xo 

limit as x approaches xo 


left hand limit; x — b from below 


partial derivative of f with respect to x 


derivative of f at the point xo 
grad f, gradient of the function f 
continuously differentiable 
apath 

twice continuously differentiable 
Hessian of f at the point xo 

del or nabla 

div F, divergence of F 

curl F, curl of F 

Laplacian 

double integral 


triple integral 
Jacobian 

opposite path 

path integral 

line integral 

scalar surface integral 


vector surface integral 
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